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Introduction 


The aim of this book, which originates from a course that we taught suc- 
cessively at Ecole Normale Supérieure (FL in 2007-2010 and MH in 2010- 
2013), is to present a broad panorama of Mathematical Logic to students 
who feel curious about this field but have no intent to specialize in it. As 
a consequence we have deliberately chosen not to write another com- 
prehensive textbook, of which there already exist quite a few excellent 
ones, but instead to deliver a slim text which provides direct routes to 
some significant results of general interest. 


Our point of view is to treat Logic on an equal footing to any other 
topic in the mathematical curriculum. Since one does not have to define 
natural numbers when teaching Number Theory, or sets when teaching 
Analysis, why should we in a Logic course? For this reason we start the 
book with a presentation of naive Set Theory, that is, the theory of sets 
that mathematicians use on a daily basis. It is only in the last chapter 
that we discuss the Zermelo-Fraenkel axioms, which in fact most math- 
ematicians who are not Set Theorists or teaching a logic course are not 
so familiar with. 


In each chapter we have tried to present at least a few juicy high- 
lights, outside Logic whenever possible, either in the main text, or as ex- 
ercises or appendices. We consider exercises as an essential component 
of the book, and we encourage the reader to work them out thoroughly; 


ix 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021 for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 
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they should be seen not only as a tool to check that the course is correctly 
assimilated, but also as a way to provide an opening to additional topics 
of interest. 


The book is organized as follows. In the first chapter, in addition to 
the basic theory of ordinal and cardinal numbers, we cover more exotic 
topics like Goodstein sequences, infinite combinatorics (clubs and Solo- 
vay’s Theorem) and Hindman’s Theorem (a striking result in additive 
combinatorics). In Chapter 2] we introduce First-order Logic and for- 
mal proofs. We prove Gédel Completeness via Henkin witnesses. Craig 
Interpolation and Beth Definability are treated in exercises. The next 
chapter delves deeper inside Model Theory, with detailed coverage of 
Quantifier Elimination. In particular we prove Quantifier Elimination 
for algebraically closed fields, which allows one to state and prove the 
Lefschetz Principle in Algebraic Geometry and Ax’s Theorem on surjec- 
tivity of injective polynomial mappings. Chapter [jis devoted to basic Re- 
cursion Theory and culminates with the existence of universal recursive 
functions, undecidability of the Halting Problem and Rice’s Theorem. 
In Chapter [5] we prove the classical undecidability and incompleteness 
results of Tarski and Church and provide a complete proof of Gédel’s Sec- 
ond Incompleteness Theorem which we found in Martin Ziegler’s book 
[13]. We also present, as an exercise, a theorem of Tennenbaum about 
the inexistence of non-standard countable recursive models of Peano. 
Finally in Chapter 6] we develop Axiomatic Set Theory, including the 
Reflection Principle and some proofs of independence and relative con- 
sistency. 


This book is intended towards advanced undergraduate students, 
graduate students at any stage, or working mathematicians, who seek a 
first exposure to core material of mathematical logic and some of its ap- 
plications. Prerequisites are minimal: besides familiarity with abstract 
reasoning and basic mathematical concepts, some acquaintance with 
General Topology and Algebra, especially Field Theory, is required at 
various places. 


For the interested reader, here are a few suggestions for further read- 
ing, providing more comprehensive and advanced material, ordered by 
increasing difficulty: 


Model Theory: The books by Marker [8], Poizat [9] and Tent-Ziegler [[12]]. 
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Set Theory: The books by Krivine [6], Kunen [7] and Jech [5]. 
Recursion Theory: The books by Cooper [i], Rogers [[10] and Soare [/I1]]. 
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Chapter 1 


Counting to Infinity 


Introduction 


The general purpose of this chapter is to provide tools for comparing 
sizes of infinities. To this aim we develop in [I-3H1.4the notion of ordinals 
which constitute a natural infinite extension of natural numbers. Ordi- 
nals classify well-ordered sets and are a natural device for using transfi- 
nite induction. A fundamental and very useful fact is that ordinals are 
endowed with arithmetic operations extending those of natural num- 
bers, even though some classical properties do not extend (for instance 
addition of ordinals is no longer commutative). 


Building on the notion of ordinals, we develop in the con- 
cept of cardinals, which is the right notion to compare the size of sets. 
Unlike the case of ordinals, one needs to assume the validity of the Ax- 
iom of Choice (which we discuss in [I.7) to develop a full fledged the- 
ory of cardinals. In the last sections of this chapter, we study cardinal 
arithmetic, which appears to have a much richer theory of exponenta- 
tion than ordinal arithmetic. 


1.1. Naive Set Theory 


In this chapter we shall use the notions of set and natural number in 
the same way as in any mathematical textbook, that is, in a naive sense, 
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2 1. Counting to Infinity 


without further questioning. It is only in the last chapter that we shall 
introduce the classical ZFC system of axioms of Zermelo-Fraenkel (plus 
the Axiom of Choice). We shall see that the notions and results of this 
first chapter remain valid in the more formal axiomatic Set Theory, using 
only the axioms of ZFC. We shall thus develop Cantor’s theory of ordinal 
and cardinal numbers from this naive point of view. 


When A and Bare sets, we denote by AUB, ANB and by A\B their set- 
theoretic union, intersection and difference, respectively. More generally, 
if is a set and (A;)jc; is a family of sets indexed by I, we denote its union 
by U,-; Ai and its intersection by (),_, Ai; thus we have x € U,_, Ai if 
and only x € A; forsomei € Iandx € Nie Ai if and only x € A; for 
every i € I. 

We write A C Bif Ais a subset of B,and A C BifA is a proper subset 
of B. The power set of A is denoted by P(A). It is the set of subsets of A; 
thus C € P(A) if and onlyif C C A. 

We denote by N the set {0, 1, 2, ...} of natural numbers, and we write 
N* for N \ {0}. 

We will make constant use of the extensionality principle according 
to which two sets containing the same elements are equal. 


We shall also make use of the comprehension principle which states 
that given a set A and a property P of sets, there exists a set whose ele- 
ments are exactly those elements of A that satisfy property P. We defer 
to Section 6.2 for a more precise formulation. 


1.2. The Cantor and Cantor-Bernstein Theorems 


The existence of an injective, surjective or bijective function between 
two sets may be seen as a way to compare their “size”. We start with two 
results going in that direction. 


Theorem 1.2.1 (Cantor). Let A be aset. There is no surjection A > P(A). 


Proof. Let f : A — P(A) bea map. Consider the set 
B={xEA|xE€ fx}. 


For every x € A with f(x) = B, we have x € Bifand only ifx ¢ B. 
Hence B does not belong to the image of f. 
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1.3. Orders 3 


Theorem 1.2.2 (Cantor-Bernstein). Let A and B be sets, and let f : A> 
Band g : B > Abe injective maps. Then there exists a bijectionh : B > 
A. 


Proof. We may assume A is a subset of B and f is the inclusion map. 
Indeed, one may replace A by f(A) and g by f og. Nowset C = {g"(x) | 
neEN, x €B\A}. Definea maph : B > A by h(c) = g(c) whenc EC 
and h(x) = x when x € B\ C. The map h is surjective: indeed, any 
x € ANC is of the form x = g(y) for some y € C and for any x €A\C, 
x = h(x). It is also clearly injective. 


Definition. Let X and Y be sets. One says that X and Y are equinumer- 
ous, and writes X ~ Y, if there exists a bijection between X and Y; one 
says X is subnumerous to Y, and writes X < Y, if there exists an injection 
xX-y. 


Using this terminology, the Cantor-Bernstein Theorem may be re- 
stated as: ifX < Y and Y <X, then X ~ Y. 


1.3. Orders 


This section and the next one are devoted to preliminary results on or- 
dered sets that are needed to develop the theory of ordinals. 


Definition. A partial order < on a set X is a binary relation (that is, 
given by a subset of X x X) which is transitive (if x < y and y < z then 
x < z)and antireflexive (x < x). If furthermore, for every x, y € X one 
has x < y,x =yory < x, one Says < is a total order. 


One writes x < ytomeanx < yorx=y,x > yfory<x,andx>y 
fory <x. 


If Y CX, y € Y isa smallest element if for every y’ in Y, y < y’. It 
is a minimal element if for every y’ in Y, y’ ¢ y. One similarly defines a 
largest element and a maximal element. A lower bound of Y is an element 
of X which is < all elements in Y. An infimum of Y is a largest element 
in the set of lower bounds of Y. One similarly defines the notion of upper 
bound and supremum. 
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4 1. Counting to Infinity 


Note that in a partial order, ifa < band b < athena = b. Thus 
smallest and largest elements are unique, which is in general not the 
case for minimal or maximal elements. 


Remark 1.3.1. If < is a partial order, < is a reflexive relation (x < x for 
every x € X), transitive and antisymmetric (if x < y and y < x then 
x=y). 

Conversely, if < is a binary relation on a set X which is reflexive, 
transitive and antisymmetric then the relation < defined by x < y:@ 
(x < yand x # y) isa partial order on X. 


Proof. Exercise. 


Definition. 
(1) Let < be a partial order on X. We say < is well-founded if any 
non-empty subset of X contains a minimal element. 


(2) A well-order is a well-founded total order. 
Remark 1.3.2. Let < be a partial order on X. 


(1) The mapa X.q = {x © X | x < a} identifies (X, <) with 
a subset Y of P(X) endowed with the partial order induced by 
ic. 


(2) < is well-founded if and only if there is no infinite decreasing 
sequence in X. 


(3) < is a well-order if and only if every non-empty subset of X 
contains a smallest element. 


Proof. The proofs of (1) and (3) are left as exercises. 


Let us prove (2). If (X, <) is not well-founded, there exists a subset 
® # Y C X without minimal element. By induction on n € N one 
may construct y, € Y such that yji1 < yy. Conversely, if Wy)nen is a 
decreasing sequence, it is clear that Y = {y, | n € N} does not contain 
a minimal element. 


Example 1.3.3. 


(1) The usual order < on Z is a non-well-founded total order. Its 
restriction to N is a well-order. 
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1.4. Operations on Orders 5 


(2) For every set X, the relation C is a partial order on P(X) which 
is well-founded if and only if X is finite 


(3) The restriction of a partial order (resp. total, well-founded) on 
X to Y CX isa partial order (resp. total, well-founded) on Y. 


1.4. Operations on Orders 


Definition. Let X and Y be partially ordered sets. 


(1) The ordered sum of X and Y, denoted by X + Y, is the partially 
ordered set consisting of pairs (x, 0) with x € X and (y, 1) with 
y € Y, the order being defined as follows: (a, i) < (b, j) ifi < j 
or ifi = janda< b. 

(2) The reverse lexicographic product of X and Y is defined by en- 
dowing the cartesian product X x Y with the order: (x,y) < 
(x’,y')ify < y' orify = y’ andx < x’. It is still denoted by 
XXY. 


By an isomorphism between two partially ordered sets X and Y we 
mean a bijection f between X and Y such that for any x, x’ € X one has 
x <x’ if and only if f(x) < f(x’). 


Lemma 1.4.1. 
(1) The ordered sum of total orders (resp. well-founded partial or- 
ders) is a total order (resp. well-founded). 


(2) The reverse lexicographic product of two total orders (resp. well- 
founded partial orders) is a total order (resp. well-founded). 
(3) LetX, Y and Z be partially ordered. We have the following canon- 
ical isomorphisms of partially ordered sets: 
(a) (X+Y)4+Z2X4+(V+Z). 
(b) (XX Y)xZ=Xx(Y XZ). 
(c) XX(Y+Z)=(X x Y)+(X x Z). 


Proof. The only non-trivial point to check is that the reverse lexico- 
graphic product of two well-founded partially ordered sets is well-founded. 


lBy a finite set we mean a set into which N does not inject. 
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6 1. Counting to Infinity 


Let X and Y be two well-founded partially ordered sets. Let Z be a non- 
empty subset of XxY. We denote by z : XxY — Y the projection on the 
second factor. The order on Y being well-founded, there exists a minimal 
element yp in z(Z) C Y. Since the order on X is well-founded, there is a 
minimal element x in the (non-empty) set Z,, = {x € X | (xX, yo) € Z}. 
It is clear that (x9, yo) is minimal in Z. 


Definition. Let X and Y be totally ordered sets. We assume that X ad- 
mits a smallest element 0. One defines the partially ordered set X as 
follows. As a set, it is the set of functions from Y to X with finite sup- 
port, that is, the subset of the set X Y of all functions Y ~ X consisting 
of functions f : Y > X such that 


supp(f) :={y EY | f(y) #0} 


is finite. One sets f < gif there exists y € Y such that f(y) < g(y) and 
fQ") = 80”) for every y’ > y. 


Proposition 1.4.2. Let X,Y and Z be totally ordered sets, and assume 
that X admits a smallest element 0. 


(1) The relation < defines a total order on X‘*) which is well-founded 
when the orders on X and Y are both well-founded. 


(2) There are canonical isomorphisms of totally ordered setsX‘¥) = 
XM) x X@ and XU*2) (xm), 


Proof. The only non-trivial point to check is that if X and Y are well- 
ordered, then X’ (Y) is well-founded. Let Z bea non-empty subset of X’ (Y), 
Let us prove that Z contains a smallest element. If the constant function 
with value 0 belongs to Z, there is nothing to prove. Hence, we may 
assume supp(f) 4 @ for every f € Z. Let 


Y=ti(/) | f € 2}, 


where s,(f) = max(supp(f)). Let y, be the smallest element of Y,, and 
set Z; ={f €Z | (f) = yy}. The set Z; is an initial segment of Z, in 
other words f < gforevery f € Z, and g € Z\Z;. Let x, be the smallest 
element of {f(y,) | f € Z)}. We set 


Z,={f ©€Z, | fOr.) = x4}- 
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1.5. Ordinal Numbers 7 


The set Z, is an initial segment of Z}. If Z, contains the function with 
constant value 0 outside {y,}, we are done. Otherwise, we have supp(f)\ 
{y,} # @ for every f € Z,. Let Y; = {s.(f) | f © Z}, where s2(f) = 
max(supp(f) \ {y,}). Let y, be the smallest element of Y,, and x, the 
smallest element of {f(y.) | f € Z, andy, = s)(f)}. We set Z, = 
{f €Z, | s.(f) = yz and f(y) = x}. It is an initial segment of Z,. If Z, 
contains the function with constant value 0 outside {y,, y.}, we are done, 
otherwise one continues in the same way, constructing Y3, y3, 23, x3, Z3 
and so on. Since the sequence ();) is strictly decreasing in Y, this process 
stops after a finite number of steps. 


1.5. Ordinal Numbers 


A set X is said to be transitive if for allx € X and y € x one hasy € X. 
This is equivalent toxE xX>xCx. 


Definition. A set X is an ordinal if it is transitive and if the relation 
{(x,y) EX xX | x € y} on X defines a well-order on X. 


Proposition 1.5.1. Let a and £ be ordinals. 


(1) @is an ordinal. 

(2) Ifa#@, then®ea. 

(3) aéa. 

(4) Ifx Ea, thenx = S., :={fyea | y< x}. 
(5) If x € a, then x is an ordinal. 

(6) 8 Caifand only ifB € aorB =a. 

(7) x := aU {a} is an ordinal, denoted by at. 


Proof. (1) is clear. For (2), one considers x € a minimal. If y € x, 
then y € a by transitivity of w, and x would not be minimal. In (3), by 
antireflexivity, we have x ¢ x for every x € a. Thusa@ € a implies 
a € a. (4) follows from the fact that < is given by €. To prove (5), note 
that € restricts to a well-order on x, since x C a. Furthermore, x = S., 
is transitive, sincez €yExSz<x>BZeES.y,. 


To prove the ‘only if’ part in (6), let us assume that 8 C a. Let x 
be minimal in a \ f. Clearly 6 2 S., by minimality. Furthermore, if 
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8 1. Counting to Infinity 


y € f, then y € x since otherwise x € y and x € f. Hence 6 = S., = 
x € a. The other implication in (6) is clear, and the verification of (7) is 
immediate. 


Proposition 1.5.2. Let X bea non-empty set of ordinals. Then Oe ex “is 
a smallest element of X. 


Proof. The intersection ofa family of transitive sets is transitive, and the 
restriction of a well-order to a subset is a well-order. Hence 6 = (),-y & 
is an ordinal. We have 6 C a for everya € X. If6 ¢ X, thenf Ea 
for every a € X, by Proposition [I.5.1(6). It follows that 8 € 8, which is 
absurd. 


Theorem 1.5.3. Let a and £ be ordinals. Exactly one of the following 
properties holds: 


(1) ae B, (2)a=8, (3)B Ea. 


Proof. One sets X = {a, 6}, and one applies Proposition [1.5.2, Ifan6 = 
a, then a C f, hence a = £ ora € £ by Proposition [I.5.1(6). Similarly, 
ifanf = £, thena = for € a. The fact that these properties are 
mutually exclusive follows from the axioms of a partial order. 


Notation. From now on, we shall write a < 6 fora € 6, anda < # for 
a C 6, when a and £ are ordinals. 


Proposition 1.5.4. Let X be a set of ordinals. Then b = ee ex % isan 
ordinal. Furthermore, if y is an ordinal with y < b, there exists a € X 
such thaty € a. We shall also write b = sup,-x &. 


Proof. The set b being the union of transitive sets, it is transitive. Fur- 
thermore, b contains only ordinals. By Theorem € induces a total 
order on b. If 6 4 Z C b, then (leew a is a smallest element of Z by 
Proposition This shows that the order given by € on b is well- 
founded. 


An ordinal of the form at is called a successor ordinal. It is clear 
that at is the smallest ordinal > a. 


Definition. A limit ordinal is a non-empty ordinal which is not a suc- 
cessor. 
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Proposition 1.5.5. For an ordinal A # 9, the following conditions are 
equivalent: 

(1) Aisa limit ordinal; 

(2) A= Ue & 


Proof. (1)>(2). Let 6 = ea aand/a limit. It is clear that 6 C A. 
Conversely, assume a < 2. Then at < Aand it follows that at < A since 
Aisa limit ordinal. The statement follows, since a € at C B. 


(2)>(1). Ifa = y*, then Ua = Uaey % =y<a. 


Example 1.5.6. 


(1) One can recover the natural numbers as ordinals as follows. 
One sets 0 := Q, and inductively n+1 := nt forn EN. 
For instance 1 = {O}, 2 = {0,1} = {@,{O}}, 3 = {0,1,2} = 
(9, {D}, 19, {}}}. 

One proves by induction that n is an ordinal for every nat- 
ural number n. We shall often identify n and n. 


(2) One sets w :=)_,.n. It is an ordinal by Proposition [1.5.4. 


neN — 


Definition. One says that an ordinal is finite if it is not a limit and none 
of its elements is a limit. 


Proposition 1.5.7. 

(1) wis the set of finite ordinals. 

(2) w is the smallest limit ordinal. 
Proof. One proves first, by induction on n € N, that all elements of w 
are finite ordinals. Furthermore, a < w implies a+ < w. This proves (2). 


Ifa € w, then w < a, so either a = w or w € a. In both cases, a@ is not 
finite. This proves (1). 


Lemma 1.5.8. Let f : a — a’ be astrictly increasing map between two 
ordinals. Then f($) > 6 for every B € a. In particular, a < a’, and if f is 
an isomorphism of ordered sets, then a = a’ and f is equal to the identity. 


Proof. If there exists 6 € a with f(6) < 8, we consider 6) minimal with 
that property. Since f is strictly increasing, we have f(f(69)) < f(8o), 
which contradicts minimality. 
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The statement about an isomorphism f follows by applying the re- 
sult to f as wellas to f—!. 


Theorem 1.5.9 (Classification of well-orders by ordinals). Every well- 
ordered set X is isomorphic, as an ordered set, to some ordinal. Further- 
more, the ordinal and the isomorphism are both unique. 


Proof. Uniqueness follows from Lemma To prove existence, let 
us first note that for every x € X, any isomorphism between S., and an 
ordinal a can be extended to an isomorphism between S., = Sz, U {x} 
and at. Let 


Y={y EX | there exists f : Sc, = a for some ordinal a}. 


By uniqueness, for y € Y, the ordinal a = a(y) and the isomorphism 
f = fy are unique. Let us prove Y = X. Otherwise, there would exist 
x € X minimal in X \ Y. For y < x we have an isomorphism fy, : Sz, = 
a(y). Furthermore, these isomorphisms form a coherent family in the 
sense that for every y’ < y < x we have f, [ i = fy. (To see this, note 
that an initial segment of an ordinal is an ordinal.) We set 

a =supa(y)and f : S., >a, f(y) := fy). 

y<x 

It is clear that f is well defined and induces an isomorphism of ordered 
sets between S., and a. By the observation made at the beginning, f 
may be extended to an isomorphism between S., and a*, which leads 
to a contradiction. So we have Y = X. To conclude, one uses the same 
kind of argument, setting a(X) := sup,.y a(x) and f : X = a(x), 
xr f(x). 


Remark 1.5.10 (Transfinite induction). Let P be a property of ordinals. 
One assumes: 


¢ @ satisfies P; 
« for every ordinal a: if a satisfies P, then a* satisfies P; 


* for every limit ordinal A: if every a < A satisfies P, then A sat- 
isfies P. 


Then every ordinal satisfies P. 
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1.6. Ordinal Arithmetic 


If a and £ are ordinals, by Theorem there is a unique ordinal iso- 
morphic to the ordered sum of a and §, which one denotes by a+. One 
similarly defines af as the unique ordinal isomorphic to the reverse lex- 
icographic product a x 6 and df as the unique ordinal isomorphic to the 
ordered set «“9), Note that 0° has still to be defined: one sets 0° := 1 
and 0f :=0 for every 6 > 0. 


Proposition 1.6.1 (Ordinal addition). Let a, 8 and y be ordinals. 


(1) a+0=0+a=a. 

(2) a+l=at. 

(3) a+ (6B +y) =(a+)+y, in particulara+ Bt =(a+)t. 

(4) a < £ if and only if there exists an ordinal 6 > O such that 
p=ato. 

(5) IfB < y, thena+ 6 <a+y for every a. In particular, one may 
simplify on the left: a+B=a+ty>fB=y. 

(6) Ifd isa limit, thena+A = supg_,(a + 8) (continuity). 


(7) 1+a=a+1 when cis finite, otherwise 1+ a= a. 


Proof. (1) and (2) are clear, and (3) follows from Lemma For the 
non-trivial implication in (4), one easily checks that the ordinal 6 iso- 
morphic to the well-ordered set 6 \ a does the job. 

(5)IfB < y, by(2) and (4) one has y = 6+6, hence at+y = (a+8)+6, 
for some 6 > 0. 

(6)a+A> SUP, j(a& + B) follows from (5). Conversely, suppose 
a<u<at+d. Thenu = a+ forsome 6 withO < 6 <A. Sincedisa 
limit, one has 6+ <A, hence <a+6t < SUPg <4(& + £). 

(7) One proves by induction onn € N thatl +n =n+1. By (6) 
we have 1+ @ = w. Finally, a > w can be written as a = w + §, hence 
l+a=1l+o+B=ot+6 =a. 


From now on, we shall allow the omission of parentheses, using the 
convention that exponentiation ties are stronger than multiplication and 
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that multiplication ties are stronger than addition. For instance, one 
should read af + y as (a8) + y, and ya? as y (a). 


Proposition 1.6.2 (Ordinal multiplication). Let a, 8, y be ordinals. 
(1) a0 = 0a = 0. 
(2) al=la=a. 
(3) a(By) = (aB)y. 
(4) a(B +y) = aB + ay, in particular aBt = aB + a. 
(5) 2w=w<o2=oW+4. 
(6) Assume a # 0. If 6 < y, then aB < ay. In particular, one may 
simplify on the left: a8 = ay > B =. 
(7) IfA is a limit ordinal, then aa = SUPg <a aB (continuity). 


Proof. (1) and (2) are clear, (3) and (4) follow from Lemma [1.4.1], For 
(6), it suffices to note that if 6 < y then y = 6 + 6 for some 6 > 0, hence 
ay = a6 + ad by (4) from which it follows that ay > a. 


(7) One may assume a # 0. Let A be a limit ordinal. The inequality 
ah > SUP <a afB =: 6 follows from (6). Conversely, let y < ad. Eu- 
clidean division, proved in the next lemma, provides a pair of ordinals 
(e, 4) such that y = aw +, with p < a. Since u < A by (6), we have 
bt <A because / isa limit ordinal, hence y = aut+p <aut+a=aput < 
6. In (5), 2a = w follows from (7), the other statements being clear. 


Lemma 1.6.3 (Euclidean division). Let a and 8 be ordinals, with a # 0. 
Then there exists a unique pair of ordinals (¢, 4) such that p < a and 


p=apto. 


Proof. Uniqueness: Assume a+p = ap’+p' withp,p’ <a. Ifu<w', 
then aut+p < aut <a’ < ap’ +p’, which is absurd. Hence p = pw’ by 
symmetry, and one obtains p = p’ after simplifying. 

Existence: When £ = 0 there is nothing to prove. Assume 6 + 0. 
The mapping fo : 8 > ax, x & (0,x) is strictly increasing, hence 6 < 
aB by Lemma [I.5.8| If 8 = af, one sets u = 6 and p = 0. Otherwise, we 
have 6 € af. Let f be the unique isomorphism of ordered sets between 
aB and ax. One sets (p, “) = f (8). Since S.¢p ,) = (aX) +p it follows 
that 6 = a+. 
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Note that we have only used properties (1)-(4) and (6) from Propo- 
sition in our proof of Euclidean division, thus avoiding circularity. 


Proposition 1.6.4 (Ordinal exponentiation). Let a, 6, y be ordinals. 


(1) For every a, we have x° = 1, a! = aand 1% = 1. If a # 0, then 
0% = 0. 


(2) aP+Y = eal, in particular ah* = oa, 
(3) (a®)" = oY, 
(4) Ifa>landB <y, thena® <a’. 


(5) If is a limit ordinal and a + 0, then a+ = SUPg <a a® (continu- 


ity). 


Proof. (1) is checked directly, and statements (2) and (3) follow from 
Proposition [1.4.2. 

(4)8 <y>y=£ +46 for some 6 > 0. Hence a’ = aft? = aha’, 
But a > 1 since asa set a“) contains at least two elements. It follows 
that a” > a by Proposition [I.6.2(6). 

Let us prove the non-trivial inequality in (5). Let f € «. One may 
assume f is not the constant function with value 0. Then s,(f) < A, and 
hence 8 = s,(f)+ < A, which proves there exists a strictly increasing 
function Scr > a‘). One concludes by Lemma 


Remark 1.6.5. The following formulas would allow us to define ordinal 
addition, multiplication and exponentiation by transfinite induction: 


*a+0=a,a+f* =(a+)*,anda+A = supg_,(a + 8) for 
A a limit ordinal. 


+ a0 = 0,a8* = aB+a,and ad = supg_4(a8) for A a limit 
ordinal. 


- Assume a # 0. Then one has a = 1, aB* = a8a, and at = 
SUP ga (a*) for A a limit ordinal. 
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1.7. The Axiom of Choice 


Given a family of sets (X;);-7, one defines their product as 


[pm=|rs2- Ux 


ieI ieI 


f@ €X; for alli cit. 


Definition. The Axiom of Choice (AC) states that the product of a family 
of non-empty sets is non-empty: if X; # @ for alli € I, then [],_, Xi # O. 

In the Zermelo-Fraenkel system of axioms ZF, (AC) is equivalent 
to Zorn’s Lemma and also to Zermelo’s Theorem. We shall prove these 
equivalences in the last chapter of this book, and accept them for the 
moment. 


Definition. A partially ordered set X is inductive if any totally ordered 
subset Y C X admits an upper bound in X. (In particular, such an X is 
non-empty). 


Zorn’s Lemma. Every inductive partially ordered set admits a maximal 
element. 


Zermelo’s Theorem (Wohlordnungssatz). Everysetcan be well-ordered. 


1.8. Cardinal Numbers 


We now assume, until the end of the penultimate chapter, that the Ax- 
iom of Choice holds. 


Definition. An ordinal is a cardinal ifit isnot equinumerous to a smaller 
ordinal. 


Example 1.8.1. 


(1) Any finite ordinal is a cardinal. 

(2) The ordinal w is a cardinal. When considered as a cardinal it 
will be denoted by No. 

(3) If a is an infinite ordinal, then at is not a cardinal. (Indeed, 
a* and @ are equinumerous.) 


Proposition 1.8.2. Any set X is equinumerous to a unique cardinal, de- 
noted by card(X). 
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Proof. By Zermelo’s Theorem and Theorem X is equinumerous 
to an ordinal a. Let 6 < a be minimal such that 6 is equinumerous to a. 
Then £ is a cardinal and is in bijection with X. Uniqueness is clear. 


Proposition 1.8.3. Let X and Y be sets and assume that X is non-empty. 
The following statements are equivalent: 


(1) card(X) < card(Y). 
(2) There exists an injective map X > Y. 
(3) There exists a surjective map Y > X. 


Proof. (1)=>(2) is easy. 

(2)>(3): Let f : X > Y be an injective map. As X is non-empty, 
one may fix x9 € X. One defines a surjective map g : Y > X by set- 
ting g(y) = Xo ify € im(f) = {f(x) | x © X}, and g(y) := f-'G) 
otherwise. 

(3)=>(1): If there exists a surjective map Y — X, then there exists a 
surjection g : A = card(Y) > « = card(X). The map f sending a € x 
to the minimal 6 € A such that g(6) = a provides an injection x > 2. 
In particular, « is in bijection with some ordinal y < A. (One takes y 
as the unique ordinal which is isomorphic to the well-order induced on 
im(f).) 
Definition. A set X is said to be countable if card(X) < No, and finite if 
card(X) < No. 


Proposition 1.8.4. Let X be a set of cardinals. Then A = sup,.y k is a 
cardinal. 


Proof. If a < A, then a < x for some x € X. Since x is a cardinal, we 
have x = card(x) < card(A), and hence @ < card(A). This proves that 2 
is not equinumerous to some smaller ordinal. 


Notation. From now on, x, A, etc. will denote cardinals. 


There is no largest cardinal. Indeed, if x is a cardinal, then” := 
card(P(x)) > « by Cantor’s Theorem. In particular, the set of all cardi- 
nals < A that are > xis non-empty. We denote by x* its smallest element, 
called the cardinal successor of x. To avoid confusion, from now on the 
ordinal successor of a will be denoted by a + 1. 
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Definition. The X-hierarchy assigns to any ordinal a cardinal as fol- 
lows: 


° No 2= 0. 
© Rati t= Ng. 


°N I= SUP g ca Ng, if a is a limit ordinal. 


By transfinite induction, one proves that a < B > Ng < Ng. Incom- 
bination with the next result, it follows that the X-hierarchy provides a 
strictly increasing enumeration of the infinite cardinals by the ordinals. 


Proposition 1.8.5. Every infinite cardinal is of the form X, for some a. 


Proof. Let x be an infinite cardinal. The function 6 +> Nz is strictly 
increasing on x + 1, and it takes its values in &,4,;. Thus &, > « by 
Lemma and hence &,4; > «x. Leta < «+1 be minimal with 
XN, > «. Since x > No, we have a > 0. If a were a limit ordinal, by 
definition we would have x € U oe Ne, and hence x € Ne for some 
B < a, which would contradict the minimality of a. Thus a = 6 + 1 and 
also Ng < x < Nei = NZ. Since NF is the cardinal successor of Ng, 
necessarily Ng = x. 


1.9. Operations on Cardinals 


If X and Y are sets, one denotes by X + Y their disjoint union, by X x Y 
their cartesian product and by X* the set of maps from Y to X. If x and 
A are cardinals, one denotes by x +d the cardinal of their disjoint union, 
by xA the cardinal of their cartesian product and by x? the cardinal of the 
set of maps from 4 to x. These operations are respectively called cardinal 
addition, cardinal multiplication and cardinal exponentiation. 


They should not be confused with the corresponding ordinal oper- 
ations. For instance 2° = w = No < 2%0; also No2 = No, but w < w2. It 
is clear that on finite cardinals all of these operations correspond to the 
usual arithmetic operations. Note that card(x + Y) = card(X)+card(Y), 
card(X x Y) = card(X)card(Y) and card(X¥) = card(x)ca™d@), 

The proof of the following statements is immediate, using Proposi- 
tion 1.8.3). 
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Proposition 1.9.1. Let x,A and yu be cardinals. 


(1) Cardinal addition and multiplication are commutative and as- 
sociative, multiplication is distributive with respect to addition, 
At = ai, (x2) = x4 and (KA)K = KHAK. 

(2) Ifx <A, thenx +p <A+pm Ku < Auand k* < AK (when xk > 0) 
and u™ < 4 (when u > 0). 


Proposition 1.9.2. One has card(R) = 2%, 


Proof. There is an injection h : 28° — R sending a sequence (aj)jcn 
to the sum >), a,2~' if the support of the sequence is infinite, and to 2 + 
br a,;2~' otherwise. This proves that 2%o < card(R). On the other hand, 
the image of h contains the interval (0, 1) which is equinumerous to R 
(for instance via x + 1/7 arctan(x) + 1/2); hence card(R) < 2®. 


Proposition 1.9.3 (Hessenberg’s Theorem). For every infinite cardinal 
K, one has KK = x. 


Proof. By induction on a, we will prove that NyXq = Ng. 
For a = 0 this is clear. Indeed, the mapping a, : N* > N defined 
by a2(m,n) := 1/20m + n+ 1)(m +n) + nis bijective. 


Let us now assume NgNg = Ng for every B < a. One endows NX, x 
NX, with the following order: 


(8,7) < (B', y’) if max(B, y) < max(6’, y’), or 
if max(6, y) = max(f’, y’) and 6 < f’, or 
if max(6, y) = max(f’,y’), 6 = 6’ andy <7’. 
One checks easily that this is a well-order. Furthermore, for every 6 < 


Ng, the set 6 x 6 is an initial segment for <. By Theorem there is a 
unique isomorphism of ordered sets f : € > Ng X Ng with ¢ an ordinal. 


Assume € > Ny. Then Ny € cand f(Xqz) = (Bo, %) € Ng X Ng. Set 
6p := max(fo,%) + 1. Since no infinite successor ordinal is a cardinal 
(by Example [I.8.1)), we have 6) < XN, and the restriction of f to Ng is an 
injective map from Ng to dg X do, a set of cardinality 


card(d9 X dp) = card(d9) < dg < Ng 
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by the induction hypothesis. This is a contradiction, and thus one has 
Na Xa <Ra- 
The inequality in the other direction is clear. 


Example 1.9.4. Let J be the set ofall open subsets of R. Then card(7) = 
2°, 


Proof. The mapping assigning to a real number r € R the open interval 
(r, +00) defines an injection from R to J, which proves that card(7) > 
2°, 

Conversely, note that every open subset of R is a union of intervals 
of the form (q,q + q’), with q € Q and q’ € Q,y. The mapping sending 
YCQxQ,, to Ucaaner q+’) provides a surjection of P(Q x Q,9) 
to J. Since Q x Q,y is countable, one deduces that 2*° > card(J). 


Proposition 1.9.5. 


(1) Let X and Y be non-empty sets and assume that at least one of 
them is infinite. Then 


card(X U Y) = card(X x Y) = max(card(X), card(Y)). 
(2) Letx > Ny anda > 0 becardinals. Thenxn +A = KA = 
max(x, A). 
(3) Let (X;)ier be a family of sets with at least one X; infinite. Then 
(*) card (U x) < sup ({card(X;) | i € I} u {card()}). 
iel 
(In particular, a countable union of countable sets is countable.) 
If furthermore the sets X; are all non-empty and mutually dis- 
joint, then equality holds in (4). 
Proof. (1) Let x = max(card(X), card(Y)). We have 
x <card(X UY) <xK+x=2« < xx 


and x < card(X x Y) < xx. One concludes by Hessenberg’s Theorem. 


(2) is a special case of (1). 
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(3) Let X = {(x;, i) | x; € X; for some i € I} be the disjoint union of 
the sets X;. There is a canonical surjection X > J,_, X;, hence it suffices 
to prove that 


card(X) < sup ({card(X;) | i € I} u {card()}). 


ieI 


Let x = sup{card(X;) | i € I}, and let Y; be the set of injective maps 
X; — «. Since the sets Y; are all non-empty, by the Axiom of Choice 
there exists some f = (fi)ier € Iie Y;. Consider g : X > «x1, defined 
by g((%j,i)) := (fj), i). The function g is injective, hence card(X) < 
x card) = max(x, card(J)). The equality statement is clear. 


It follows from the preceding proposition that cardinal addition and 
multiplication is quite trivial for infinite cardinals. The situation for car- 
dinal exponentiation is very different. In fact, the ZFC axioms are far 
from completely determining the values of cardinal exponentiation. For 
instance they do not allow to settle the continuum hypothesis: 


Definition. 


* The Continuum Hypothesis (CH) is the statement 2X0 = Xj. 


* The Generalized Continuum Hypothesis (GCH) is the state- 
ment 2” = x* for every infinite cardinal x. 


If (x;)jer is a family of cardinals, we shall denote by >’, «; the car- 
dinal of the disjoint union of the x;, and by |],_, «; the cardinal of the 
product of the family. 


Theorem 1.9.6 (K6nig’s Theorem). Let (x;)jez and (A;)ier be families of 
cardinals with x; < A; for every i. Then 31.7% < [Lje,Ai- 


Proof. Let f : Di-7*%i > []j,_,4i- For every i, f induces a mapping 
fi: % — A; given by the ith component of the restriction of f to xj. 
Since x; < A;, the set B; := A; \ im(f;) is non-empty for every i. By (AC) 
there exists some b € [J ,_, Bj © [],-,4i- Clearly b ¢ im(f). 


1.10. Cofinality 


In this section we shall use the notion of cofinality to prove for instance 
that 280 4 X,,. 
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Definition. 


« Let X be a totally ordered set. We say that a subset Y C X is 
cofinal in X if Y is not bounded in X, that is, if for any x € X 
there exists y € Y such that x < y. We say that a function 
f : Z—-X is cofinal if im(f) is cofinal in X. 


* The cofinality of an ordinal a, denoted by cof(q), is the smallest 
ordinal 6 such that there exists a cofinal function 6 > a. 

Example 1.10.1. 

(1) cof(0) = 0. 

(2) cof(a + 1) = 1 for any ordinal a. 

(3) cof(w) = w. 
Proposition 1.10.2. Let a be an ordinal. 

(1) cof(a) <a. 

(2) cof(a) is a cardinal. 


(3) cof(«) is the smallest ordinal 6 such that there exists a cofinal 
and strictly increasing map B > a. 


(4) cof(cof(a)) = cof(a). 


Proof. (1) is clear, and (2) follows from the fact that any ordinal £ is in 
bijection with card() < f. 

(3) It is enough to provide some f < cof(@) and acofinal and strictly 
increasing map 6 — a. By hypothesis, there exists a cofinal maph : 
cof(a) > a. Let us define 


X = {x €cof(a) | h(y) < h(x) for every y < x}. 


The set h(X) = {h(x) | x € X} is cofinal in a. Indeed, let y < a. By 
the cofinality of h, there exists y € cof(a) such that h(y) > y. When y is 
minimal with this property, we have y € X. 

Since (X, <) & (8, €) for some 6 < cof(a), we are done, because the 
restriction of h to X is cofinal and strictly increasing. 

(4) cof(cof(a)) < cof(«) follows from part (1). For the inequality 
in the other direction, let us consider the cofinal and strictly increasing 
functions f : cof(cof(a)) — cof(~) and g : cof(a) > a, which are 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021 for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


1.10. Cofinality 21 


possible by (3). Then the function go f : cof(cof(a)) > a is cofinal, and 
hence cof(a) < cof(cof(a)). 


We shall say that an infinite cardinal x is regular if cof(x) = x, and 
singular if cof(x) < x. 


Proposition 1.10.3. Any infinite cardinal which is a successor is regular. 
In particular &, is regular. 


Proof. Let x = Ne+1 = Nee Note that for a limit ordinal a, a subset 
X C ais cofinal if and only ifa = U, ex Y. (This follows from Propo- 
sition |1.5.5.) Consider a function f : A > x for some’ < x. Then 
A < Xg and it follows from Proposition [1.9.5(3) that card (U Bea i (6)) < 
sup ({card (f(8)) |B < A}U {A}) < Ng. Hence f is not cofinal. 


Proposition 1.10.4. [fA is a limit ordinal, then cof(&®,) = cof(A). 


Proof. If f : « > Ais cofinal, then f : a > Xj, Br X (gy is cofinal 
too, since Ny, = eg N, by definition. This proves cof(®,) < cof(A). 
Conversely, let g : a > Ny be cofinal. The map § : a — A, defined by 
&(8) = 0 if g(8) is finite, and g(8) = y if card(g(8)) = &,, is cofinal. 


Proposition 1.10.5. Let x > 2and A > No be cardinals. Then cof (x*) > 
A. 


Proof. Consider a map f : a > x+, with w some ordinal < 4. Since 
f(B) < x? for every B < a, it follows from Kénig’s Theorem that 


card (LU 7) < ¥\ card(f(8)) < T] (x4) = 4288 < xt. 


B<a B<a B<a 


Hence f is not cofinal. 


Corollary 1.10.6. 2% # X,,. 


Proof. We have cof(X,,) = cof(w@) = w = No < cof(2*°). 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


22 1. Counting to Infinity 


1.11. Exercises 


Exercise 1.11.1. 


(1) Prove that an ordinal a isa limit if and only if there is an ordinal 
6 4 Osuch that a = wf. 


(2) Prove that w? = ww is not of the form 6 + a. 
Exercise 1.11.2 (Cantor normal form). Let @ be an ordinal > 1. 


(1) Prove that a” > y for any ordinal y. (Is there any example with 
ay’ = y?) 

(2) Let 6 be an ordinal > 0. Prove that there exists an ordinal y 
such that a’ < B <a”. 


(3) Deduce that any ordinal 6 can be expanded in basis a: there ex- 
ists a finite sequence of ordinals 6, > ... > 6, > 0 and ordinals 
k; with 0 < k; < a such that 


B= oFtk, +... 4+ cPnk,. 


Furthermore the natural number n and the sequences (§;) and 
(k;) are unique. 

The expansion into basis w is called the Cantor normal 
form. 


Exercise 1.11.3 (Goodstein sequences). Let n, p be natural numbers, 
with p > 2. Let us define the iterated expansion of n in basis p as follows: 
first expand n in basis p (for instance ifn = 35, p = 2, one has 35 = 
2° +2 +1). Then, expand the exponents in basis p and so on, so that all 
numbers occurring are < p. For instance the iterated expansion of 35 in 
basis 2 is 35 = 27414241. 


For q = p = 2, one defines a function f,_ : N > N as follows. Let 
n be a natural number. Then f, (1) is obtained by replacing all occur- 
rences of p in the iterated expansion of n in basis p by q. One similarly 
defines ordinal valued functions f, .. by replacing all occurrences of p by 
w (when p > 2, one writes the coefficients at the right of the p”). For in- 
stance, f,3(35) = 3? 414341 = 59053 and fy, .,(35) = fyy(33-+3-2+2) = 
wo? +o0-242. 


(1) Prove that fo, ° fog = Jp, for anyw>r>q> p22. 
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(2) Prove that the functions f,., are strictly increasing. 


For any a EN, the Goodstein sequence attached to a is the sequence 
(8n(@))n>2 defined by g,(a) := a and 


= San+i1(8n(a)) =u if g,,(a) # 0, 
8n41(@) = : 
0 otherwise. 
For a = 5, one has for instance g>(5) = 5, g3(5) = 33 +1—1 = 27, 
g4(5) = 44 —1 = 255, and g5(5) = 5° -3+57-34+5-3+3-1= 467. 


(3) Let a be a natural number. Study the monotonicity of the se- 
quence Jp.o(8p(@))- 

(4) Prove that for any a € N there exists s(a) € N such that g,,(a) = 
0 when n > s(a). 


Exercise 1.11.4 (The order topology). On any totally ordered set X, one 
may define a topology, the order topology, generated by the open inter- 
vals, that is, by the sets of the form (—oo,b) = {x € X | x < 5b}, 
(a,b) ={x EX | a< x <b}or(a,o), fora,b EX. 


(1) Prove that for any X, the order topology is Hausdorff. 


(2) Let wand £ be ordinals endowed with the order topology. Prove 
the following statements: 
(a) a is discrete if and only ifa < w. 
(b) aw is compact if and only if « is not a limit ordinal. 
(c) Suppose f : a > 6 is a weakly increasing function, that 
is,x <y=> f(x) < f(). Then f is continuous if and only 
if, for any limit A € a, f(A) = SUP, <3 f™. 


Exercise 1.11.5 (Ulam’s Theorem). An Ulam matrix is a family of sub- 
sets Uz, of X,,@ < Nj, n < w, such that 
- foranyn < wanda, < &, witha # 8, Uzn OUgn = 9; 


+ for any a < Nj, the set ®; \ U,_.. Ug.n is at most countable. 


n<o 


(1) For any § < Xj, let fe : @ > X, such that € C im(fz). Set 
Ua,n i {E < Ni | fe(n) = a}. 


Prove that (U,,,) isan Ulam matrix. 
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(2) Let uw : P(X) > [0,1] be a o-additive measure on Nj, that is, 
MU ney An) = Une M(An) for all families (Ap nc of pairwise 
disjoint subsets of &,. Prove that if u({a}) = 0 for every a < Xj, 
then yu is identically zero. 


Exercise 1.11.6 (Closed unbounded sets). Let x > Ng bea regular cardi- 
nal. A set C C x is said to be a club (closed unbounded) if it is closed and 
cofinal, that is, if it is cofinal in x and for any non-empty subset A C C, 
supA € CU {x}. A subset S C x is called stationary if Sa C # @ for any 
club C Cx. 


(1) Let A < x and (Cj); <, be a family of clubs. Prove that ();_, C; 
is a club. 


(2) Let (C;);-, be a family of clubs. Prove that the diagonal inter- 
section Aj.,C; = {a<x|a €(),_, Ci} is a club. 


(3) (Fodor’s Lemma) Let S C x be a stationary set and suppose 
f : S > x isa regressive map (that is, such that f(a) < a for 
any a € S). Prove that there exists a stationary set T C S such 
that f is constant on T. 


(4) (Solovay’s Theorem for &,) Let x = Nj, and let S C XN, bea 
stationary set. Prove that S can be written as the union of & 
pairwise disjoint stationary sets. 


[Hint: For any limit ordinal a € S, write w = sup,,_., a7. Use 
this to define a regressive function to which one may apply 
Fodor’s Lemma. | 


Exercise 1.11.7 (Solovay’s Theorem). Let x > Ng be a regular cardinal. 
The aim of this exercise is to prove Solovay’s Theorem: Any stationary 
set S C x can be written as the union of x pairwise disjoint stationary sets. 


(1) For a cardinal A < x, let EY = {a < x«|cof(a) = A}. Prove 
Solovay’s Theorem when S C Ef. 


(2) Prove Solovay’s Theorem when S C {a < x| cof(a) < a}. 


(3) Assume S is a set of regular cardinals. Prove that 
T ={a € S|S cis not stationary in a} 


is stationary. 
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[Hint: Prove it by contradiction after noticing that if C is a club, 
the set C’ C C of limits of points of C is still a club.] 


(4) Prove Solovay’s Theorem in the general case. 


[Hint: Prove it is enough to consider the case when S is a set 
of regular cardinals. For a € T, consider a strictly increasing 
sequence (ay : € < x) with limit a such that ay é S. Then 
adapt the proof of (1).] 


Exercise 1.11.8 (Filters and ultrafilters). Let X be a non-empty set. A 
filter on X is a set F C P(X) such that 


(i) XE F, O€F, 
(ii) if A,B € F, then ANB € F, and 
(iii) ifA € FandA C B,thenB € F. 


A set B of non-empty subsets of X is a filter basis on X if for any 
A,B € B there is C € B such that C CANB. 


(1) Prove that any filter basis B is contained in a minimal filter 
with respect to inclusion, denoted by Fg. 


(2) Let J be a set and let I be the set of finite subsets of J. For any 
i€ I,setI; = {j € I | i C j}. Prove that the set B whose 
elements are the sets J;, i € I, is a filter basis on I. 


(3) Prove that when X is infinite, the set of subsets of X whose 
complement in X is finite is a filter. This filter is called the 
Fréchet filter. 


(4) An ultrafilter is a filter which is maximal with respect to inclu- 
sion. Prove that a filter F on a non-empty set X is an ultrafilter 
if and only if for any A C X, either A or X \ A belongs to F. 


(5) Prove that for any x € X, the set U, = {Y CX |x € Y}isan 
ultrafilter on X. Such an ultrafilter is called principal. 


(6) Prove that an ultrafilter is principal if and only if it contains a 
finite set as an element. Deduce that when X is finite, any ul- 
trafilter on X is principal, and when X is infinite, an ultrafilter 
U is non-principal if and only if it contains the Fréchet filter as 
a subset. 
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(7) Prove that any filter is contained in some ultrafilter. In partic- 
ular, on any infinite set there exists a non-principal ultrafilter. 


Exercise 1.11.9 (Hausdorffs Theorem). The aim of this exercise is to 
prove the following result due to Hausdorff: On an infinite set X of car- 
dinality x, there exist 22") distinct ultrafilters. 


(1) A family F of subsets of X is said to be free if whenever Aj, ..., 
Ay, By, ..., Bm are distinct elements of F, then Ay N---N A, AN 
(X \ B,) N--- A(X \ B,,) is non-empty. Prove that Hausdorff 
Theorem is a consequence of the following statement: there 
exists a free family of cardinality 2”. 


(2) Consider the set Y consisting of pairs (F, (A, ..., P,)) with F a 
finite subset of X and (A, ..., B,) a finite sequence of of subsets 
of F. What is the cardinality of Y? 


(3) To any A € P(X) we assign a subset A’ of X UY as follows: 
* ifx eX, then x € A’ if and only if x € A; 
- ifye Y andy =(F,(#....,B,)), then y € A’ if and only if 
An Fis one of the P’s. 
Prove that the set of the A’ for A C X isa free family (on XUY). 


(4) Conclude Hausdorffs Theorem. 


Exercise 1.11.10 (The continuum hypothesis for closed subsets of R). 
The aim of this exercise is to prove that the continuum hypothesis holds 
for closed subsets of the real line (Cantor’s Theorem), that is, if F C R is 
closed, then either F is countable or card(F) = 2°. 


Let F C R be aclosed subset such that card(F) > No. 


(1) Let C be the set of x € F such that there is an open set Q C R 
which contains x and satisfies card(Q NF) < No. Set F’ := 
F\C. 

Prove the following properties: 
(a) C is countable, 
(b) F’ is aclosed subset of R, and 
(c) any non-empty open subset of F’ is uncountable. 


(2) Let I be an open interval such that I nF’ # @. 
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Prove that for any ¢ > 0 there are open intervals I) and I, 
of length at most ¢ such that I; nN F’ # @ and JI; C I fori = 0,1 
and such that Ip NI, = 9. (Here, J denotes the closure of J.) 


(3) Prove that there is an injection of 2*° into F’. 


(4) Prove Cantor’s Theorem. 


Exercise 1.11.11. A tree is a partial order (T, <7) such that for every 
t € T, the set 


f:={seET|s<_t} 


is well-ordered. If T has a smallest element, it is called the root of T. 


The height of t € T is defined as the unique ordinal isomorphic to 
(f, <7) and denoted by ht(f). Finally, ht(T) := sup{ht(t)+1|t € This 
the height of T. For an ordinal @ one sets T(a) = {t € T | ht(t) = a}. 

A branch in a tree (T, <r) is a totally ordered subset which is max- 
imal with respect to inclusion. It is called cofinal in T if it non-trivially 
intersects any level T(a) for a < ht(T). 


Prove Konig’s Lemma: If T is a tree of height w such that T(n) is 
finite for every n € w, then T contains a cofinal branch. 


Exercise 1.11.12. 


(1) For a € NX, let Yq be the set of strictly increasing non-cofinal 
maps from @ to Q. For 6 < a, denote by PB : Yq > Yg the (sur- 
jective) restriction map. Prove that there are countable subsets 
Xq © Y, such that for any a > 6, the following properties hold: 

(a) P3(Xq) = Xp. 

(b) Forany f € Xg, any upper bound q € Q of im(f) and any 
€ > O there is g € X, with p3(8) = f such that im(g) is 
bounded above by q + «. 


(2) Let 1 X,, be the set of all f = (fy) € ITaer, X,, such that 
a — 7 _— 
P3(fa) = fg for all 8 < a. Prove that LM Za Xy = 9. 
(3) Deduce Aronszajn’s Theorem: There is an Aronszajn tree, that 


is, a tree T of height &, with T(a) countable for all a < &, such 
that T does not admit a cofinal branch. 
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1.12. Appendix: Hindman’s Theorem 


In this appendix we shall freely use the material on ultrafilters treated in 
Exercise [I.11.8, Let E be a non-empty set and U(E) the set of all ultra- 
filters on E. When BE is infinite, there exist non-principal ultrafilters on 
E. 


For a subset A C E, set (A) = {U € U(E) | A € U}. 


Lemma 1.12.1. The sets (A), for 6 # A C E, form a basis of open sets for 
a compact Hausdorff topology on U(E). 


Proof. We have (A,) N --- N(A,) = (Ay N-+: NA,), which proves that 
the sets (A) form a basis of open sets for some topology. Furthermore, 
(E \ A) = U(E) \ (A), thus (A) is open and closed. In particular, the 
topology is Hausdorff. 

Consider a covering LJ er i Of U(E) by open subsets. To prove the 
existence of Ig C I finite such that Viet Q; = U(E) we may assume 
QO; = (A;) for any i € I. If for some finite IJ C I one has Vier, A; = E, 
we are done, since then U; er, (A;) = U(E). Otherwise, the set 


B=4()|E\A;| Io CI finite 


ieIg 
is a filter basis and is contained in some ultrafilter U on E by Exercise 
1.11.8, which contradicts (),_, Q; = U(E). 
Now consider U(N*). Let k € N*, A C N* and U € U(N*). One sets 
A-k:=N*n{a-—k|aeA}and 
Ay :={k EN*|A-—k Ee U}. 


Lemma 1.12.2. For any U € U(N*) and every A, B C N*, one has Nz, = 
N*, (A Nn B)y = Ay Nn By and (N* \A)y = N* \ Av. 


Proof. These properties are easy consequences of the fact that for k € 
N* one has N* —k = N*, (A—k)an(B—k) = ANB—kand (N*\A)-—k = 
N* \ (A—k). The details are left as an exercise. 


For U,V € U(N*), one sets U ®V := {A CN* | Ay € VI. 
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Lemma 1.12.3. 
(1) Forany U,V € U(N*), U@V € U(N*). 
(2) @ is associative. 
(3) For fixed U, the function fy : U(N*) > U(N*), Vb U OV, is 


continuous. 


Proof. (1) It is clear that N* € U @ V, and that for any A € U @ V and 
A C Bone has B € U@V. Stability of U @ V under intersection and 
the fact that A € U@ V if and only ifN* \A € U@ V are consequences 
of Lemma 


(2) One has (A — k)y = Ay — k (exercise). From this, one may 
deduce that (Ay)y = Ayer. Indeed, for 1 € N*, one has 


le(Ay)y eo Ay -LEVSA-Dy EV 
eA-lEUGVS1EAuer. 
If U,V, W © U(N*), we deduce that 
AEUGVOW)SAyEVOW 
© (Ay)y =Avav EW SACU OV EW, 
which proves associativity of @. 
(3) Let. A C N*. Continuity of ft; follows from the following: 
fa’ WA) ={V € UN) |A CU BV} 
= {V € U(N*) | Ay € V} = (Ay). 


We shall say an ultrafilter U on N* is idempotent if U @ U = U. 
Proposition 1.12.4. There exists an idempotent ultrafilter in U(N*). 


Remark 1.12.5. For m,n € N*, one can easily check that the sum of 
the corresponding principal ultrafilters is given by U,, ® U, = Uns+n. In 
particular no principal ultrafilter is idempotent. 


Proof of Proposition One considers the set 
A :={A C U(N*) | A is a non-empty closed set with A @ A C A}. 


The set A is non-empty since it contains U£(N*). Furthermore the partial 
order on A given by 2 is inductive. This follows from the fact that a 
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subset of a compact Hausdorff space is closed if and only if it is compact. 
Thus, if (A;)jey is a totally ordered subset of A, then ae epi € A. By 
Zorn’s Lemma, there exists B € A which is minimal for inclusion. 


Let U € B. We shall prove that U @ U = U. 


First, we consider B’ := U@B C B. Since fy is continuous, the set 
B’' = fy(B) is compact (hence closed). Furthermore, if ¥,¥% © B, one 
has(U@VY)@(UG\) = UMW forsome W € B, since BOB C B. This 
proves that B’ @ B’ C B’. Hence we have B’ € A, and thus B’ = B by 
minimality of B. In particular, there exists V’ € Bsuch that U@V’ = U. 

Set V' = {V'’ € B| U@V’' = U}. By continuity of the map fy, 
Vv’ = fg'(U}) n B is closed, and we just proved it is non-empty. If 
V',V" € V’, then U @(V’@V") =(U@V)@V" =UOV" =U. 
Thus V’ is closed under © and it belongs to A. It follows that V’ = B by 
minimality of B. In particular, U € V’, hence U ®U =U. 


For a subset A C N*, one denotes by 


Eq i=} >) k|Ap CA finite 
kEAg 


the set of finite sums of distinct elements of A. 


Theorem 1.12.6 (Hindman’s Theorem). Let y : N* > {]1,...,n} be any 
function. Then there exists B C N* infinite such that y is constant on Xp. 


Proof. Let U = U @U € U(N*) be an idempotent ultrafilter as pro- 
vided by Proposition [1.12.4 There then exists a unique ig < n such that 
x (io) € U. SetA := 77 "(ig). We will prove the existence of an infinite 
set B C A such that Xp C A, which will complete the proof. 


As U =U ®U, for any C € Uone has Cy € U, hence Cn Cy € U. 
In particular, CN Cy is infinite. Furthermore, for any k € CN Cy one 
has(C—k)NC EU. 


By induction on n € N, one defines a decreasing sequence (A”),cn) 
in U and a strictly increasing sequence of integers (K,),ey aS follows. 
One sets A° := A and one chooses ky € A° 1 Af. Assuming A” and 
k,, already constructed, one sets Atl := (A"—k,)n A" € U, and one 
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chooses some k,,4; > k, such that k,,; € A?*+! n AU". This is possible 
since A+! q AZ? is infinite. 

Now B = {k,, | n € N}is an infinite subset of A. Let I be a finite non- 
empty subset of N. We now prove by induction on card(J) that 97,7, ki € 
Amin! Tf J is a singleton, this is clear. Assume now that card(I) = m+1 
for some m > 0 and let ig < ... < i,, be the elements of J. By induction 
we know that 


ki, et Kin € Ab G Alo+1 = Alo fa) (Alo _- Ki, )- 


It follows that kj, +..+kj,, © A’o as claimed. We have thus proved that 
dp CA. 
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Chapter 2 


First-order Logic 


Introduction 


This chapter introduces the basics of first-order logic. First-order logic is 
a standard way to formalize mathematics. For instance Peano arithmetic 
and Zermelo-Fraenkel set theory formalize respectively number theory 
and set theory. In a given formal language, one can define the basic no- 
tions of variables, terms and formulas. One can also define formal logical 
deduction rules that allow one to deduce new statements from a given 
set of axioms via a formal proof. This corresponds to the syntactic side 
of formal logic, where constructions are purely symbolic and no specific 
interpretation or meaning is given to the symbols. 


On the other hand, structures for a given language are sets where it 
is possible to interpret terms and formulas. In particular, one can give a 
meaning to the validity of a formula with no free variables in a structure. 
This is the semantic side. 


The main result in this chapter is Gédel’s Completeness Theorem 
which states that a formula with no free variables can be formally de- 
duced from a given set of axioms if and only if it is valid in every structure 
satisfying these axioms. This particularly remarkable result shows that 
the syntactic and semantic viewpoints are equivalent. It also provides 
an a posteriori justification for our choice of syntax. 
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2.1. Languages and Structures 


Statements in first-order logic are sequences of symbols describing prop- 
erties of certain structures. For instance the statement given by 


Vx(x >0 > dyy-y=x) 


is satisfied in some ordered field if and only if every positive element is 
a square. It holds in the ordered field of real numbers, that is, in the 
structure ® =(R;0, 1, +, —, -, <), and it does not hold in the ordered field 
of rational numbers. 


We will see that this statement can be expressed in first-order logic. 
However, other properties of ordered field, like being complete (any 
bounded non-empty subset admits a supremum) or archimedean (for 
every x > 0 there exists n € N such that nx > 1), cannot be expressed 
as statements in first-order logic. 


Definition. A (jirst-order) language is a set of symbols £ composed of 
two disjoint subsets: 


(1) The first part (common to all languages) consists of auxiliary 
symbols “(” and “)” together with the following logical symbols: 
the set of variables V={v, | nEN} 
the equality symbol = (“equals”) 
connectives (negation, “not”), and 
A (conjunction, “and”) 
the existential quantifier 4 (“there exists”) 


J 


(2) The second part, called the signature of £ and denoted by o*, 
consists of the non-logical symbols of £. It consists of 
* aset of constant symbols C*; 
* a sequence of sets Fe, n € N*, where elements of F* are 
called n-ary function symbols; 
* a sequence of sets RE, n € N*, where elements of R¥ are 
called n-ary relation symbols (or n-ary predicates). 


The language £ is given by the (disjoint) union of these sets of symbols. 
Let us note that a language is always infinite. The first part being 


common to all languages, we shall make the abuse of notation of identi- 
fying £ and o%. 
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Example 2.1.1. 


Ly =O The empty language. 

Lring = (0,1, +,-,°} The ring language (with 1). 
Lord = {<} The order language. 

Loring = Fring U Lord The ordered ring language. 
Lar = {0,S,+,-,<} The language of arithmetic. 
Eset = {E} The language of set theory. 


In these examples, 0 and 1 are constant symbols, — and S unary 
function symbols, + and - binary function symbols, and < and € binary 
relation symbols. 


Definition. Let £ be a first-order language. An £-structure % consists 
of a non-empty set A (called the base set of 2) together with an element 
c™ € A for eachc € @*, a function f™ : A” > A foreach f € Ff anda 
subset R™ C A” for each R € RX. We write A = (A; (Z")zegc). 


Z" is called the interpretation of the symbol Z € o% in the structure 
A. 


Example 2.1.2. 

(1) N = (N;0,S,+,-,<) is an Lg,-structure, with S the successor 
function assigning x + 1 to x, and the other symbols having 
their usual interpretation. 

(2) © =(C;0,1,+,-, -), the field of complex numbers, is an fring” 
structure. 


(3) R = (R;0,1, +, —,-, <), the ordered field of real numbers, is an 
£o.ringStructure. 


Definition. We say that two £-structures & and % are isomorphic, 2% = 
%, if there exists an isomorphism F : & = %&, that is, a bijection F : 
A — B between the base sets of 2% and 8 which commutes with the 
interpretations of the symbols in o*, that is, 


» F(c") = c® for every constant symbol c € @*, 

© F(f™ (ay, An)) = f3(F (a), »., F(a,)) for every function sym- 
bol f € Ff and every tuple (qj,...,dn) € A”, 

* (Qy,..,4n) € R® <> (F(a,),...,F(a,)) € R® for every predi- 
cate R € RX and every tuple (aj,...,d,) € A”. 
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2.2. Terms and Formulas 


A word over a set (alphabet) E is a finite string w = aga, --- a,_; with 
a; € E for every i. We call k the length of w, and we denote by E”* the set 
of words over E.8 


Definition. Let £ be a language. The set 7“ of £-terms is the smallest 
subset D of £* containing the variables and the constant symbols such 
that if f € F£ and ty,...,t, € D, then ft, ---t, € D. 


ee T, where T° = C* u V* and, 


It is easy to see that T* = UJ 
inductively, 


Fo = 5 Ulf t, | REN fF CR? and twit EF}. 


Proposition 2.2.1 (Unique reading of terms). Any termt € J% satisfies 
one and only one of the following: 

(1) t is a variable. 

(2) t is a constant symbol. 


(3) There exists a unique integer n > 1, a unique n-ary function 
symbol f and a unique sequence (t,...,t,) of terms such that 


t= fty---ty. 


Proof. Exercise. (One first proves by induction on the length of terms 
that no proper initial segment of a term is a term.) 


Notation. For ease of reading, from now on we shall often write 
fGh,.--,t,) instead of ft, ---t,. When f is binary, we sometimes write 
t, ftz instead of ft,t,. For instance, (x + y)- Z means - + xyz. 


The height ht(t) of a term t is defined as the least natural number 
k such thatt € 4%“. It follows from the unique reading property for 
terms that ht(f(t, ---t,)) = 1+ max(ht(t;)). This will allow us to give 
definitions by induction on the height of terms. 


Definition. An atomic £-formula is 


¢ either a word of the form t, = t,, where t,t, are L-terms, 


We will continue to write N* for the set of non-zero natural numbers. As we will never use N as 
an alphabet, this should not cause any ambiguity. 
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* oraword of the form Rt, -- t,, where R € R% and all the ¢; are 
£-terms. 


The set Fml” of £-formulas is the smallest subset D of £* that contains 
all atomic £-formulas and such that ifx € V and g,wp € D, then 79, 
(pA p) and Ax¢ are all in D. 


It is easy to see that Fml* = U Fml;,, where Fmlo is the set of 


atomic formulas, and, inductively, 


nen 


Fmlny1 = Fmly U{-@ | 9 € Fm, FUL AYP) | 9p € Fmly} 
U{axg | x € Vand g € Fml|;}. 

Proposition 2.2.2 (Unique reading of formulas). Any £-formula ¢ sat- 
isfies one and only one of the following: 

(1) gis an atomic formula. 

(2) pis equal to =~) for some unique £-formula wp. 

(3) gis equal to (p A x) for some unique L-formulas yp and y. 

(4) ¢ is equal to Ax for some unique variable x and some unique 


£-formula wp. 


Proof. Exercise. (One first proves by induction on the length of formu- 
las that no proper initial segment of a formula is a formula.) 


If y is a formula, its height ht(¢) is defined as the least natural num- 
ber k such that g € Fmlk. It follows from the unique reading property 
for formulas that ht((g A p)) = 1+ max(ht(¢), ht(@)) and ht(Axgy) = 
ht(-g) = 1+ ht(v). This will allow us to give definitions by induction 
on the height of formulas. 


Definition. 


(1) Let vy, be a variable. One defines, by induction on the height 
of a formula 9, the notion of a free occurrence of U, in ¢: 
¢ If g is atomic, all occurrences of vu; in ¢ are free. 
¢ If pis equal to 72, the free occurences of v; in g are those 
in y. 
* If gis equal to (A y), the free occurrences of vu; in ¢ are 
those in ~ and those in y. 
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¢ If gis equal to dup and | # k, the free occurrences of u; 
in gy are those in yp. 
¢ If g is equal to 4u;,~, no occurrence of vu; in ¢ is free. 


(2) Occurrences of v;, in g which are not free are called bound. 


(3) The free variables of y are those having at least one free occur- 
rence in g. One denotes by Free(¢) as the set of free variables 
of ¢. 


(4) A sentence is a formula with no free variable. 


Example. If ¢ is the formula (Aug Ug < v1 A Up = Uj), the first two 
occurrences of Up are bound, and the third is free. All occurrences of v, 
in ¢ are free, thus Free(¢) = {Up, U;}. 


Notation. We will use the following abbreviations: 


(g V p) for "(7g A 7H) (disjunction) 
(g — pp) for =(@ A 7p) (implication) 
(eo p)for(g> PAW g)) — (equivalence) 
Vx for nAx7@ (universal quantifier). 


We shall write 4x},..., X, instead of 4x, --- 4x, (similarly for the uni- 
versal quantifier), R(t,,...,t,) instead of Rt, ---t,, and sometimes t,Rt, 
instead of Rt, ty. 


Weshall write (gj A---Ag,) or sometimes ae gy; instead of (--- ((@9A 
Q1) A Q2) A... A Gy), Similarly for V instead of A. 


Finally, to ease the reading of formulas, we shall sometimes add 
parentheses, or we will omit them. In this case, we shall read formulas 
according to the convention that symbols from {-,3,V} bind the strongest, 
followed by A, then v, and that symbols from {—, } bind the weakest. 


Thus, for instance Vxg A i > y shall mean ((Vxg A ~) > y), that 
is, -((Vx@ Ap) Ay), and so finally a((7Sx7¢g A ) A 7). 


Example. The field axioms can be expressed as first-order sentences in 
Syring' 

(1) Vxx+0=x. 

(2) Vx yx+y=ytx. 

(3) Vx(—x)+x =0. 
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(4) Vx, y,z(x+y)+2=x4+(y +2). 
(5) Vxx-1l=x. 

(6) Vx,yx-y=y-x. 

(7) Vx, y,2(x-y)-Z=x-(y-2Z). 

(8) Vx,y,z2x-(y+z)=(x-y)+(%-2Z). 
(9) Vx(>-x =0 > Ayx-y=1). 

(10) 70 = 1. 


2.3. Semantics 
We now explain how to interpret formulas in a given structure. 


Definition. Let 2% be an £-structure. 


(1) An assignment (with values in X) is afunction a : V > A from 
the set of variables to the base set of 2. 
(2) If wis an assignment and ¢ is an £-term, one defines ta], by 
induction on ht(t), in the following way: 
uv} [a] = a(v;) (for v; € V) and c™[a] = c™ (for c € C*), 


© f(tsstn) La] = failed, ..., tala). 
The proof of the following lemma is clear. 


Lemma 2.3.1. If two assignments a and £ coincide on allvariables having 
an occurrence in t, then t™[a] = t™[]. 


Notation. If t is a term, we shall sometimes denote it by t(xj,...,x,) if 
the variables x; are distinct and all variables having at least one occur- 
rence in ¢t belong to the x;. 


If a term f(x},...,X,) and elements aj,...,a, € A are given, one de- 
fines t™[ay,...,a,] by t™[a] with w an assignment with a(x;) = q; for all 
i, which is well defined by the previous lemma. 


Definition (Satisfaction of a formula). Let 2 be an £-structure. By in- 
duction on the height of a formula ¢, we now define for any assignment 
a the relation 2 F g[a] which will be read “¢ is satisfied in 2 by a”: 
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AE =t[a] = tila] = tla] 

AWE Rta] : —> (tf la],..., [a]) | R® 

AE ap[a] <=> AF yY[a] (that is, not WF p[c]) 
AEWA HI[a] => AE pla] and WE y[a] 


WE Axp[a] : => there exists a € A with & F p[aq/,]. 
Here, aq), denotes the assignment defined by aq,,(x) = a and 
Gayx(V) = ay) for y # x. 


Proposition 2.3.2. [f two assignments a and f coincide on Free(¢), then 
one has & F gla] ifand only if AE yf]. 


Proof. By induction on ht(gv). The case of atomic formulas follows from 
Lemma 2.3.1}. In the induction step, only the case where ¢ is equal to 
Axp deserves an argument. If 2 F gla], there exists a € A such that 
WE plag,]. Any variable y # x that is free in > is also free in g. Hence 
FE [8g] by the induction hypothesis, and so & F g[f]. 


Notation. A formula ¢ shall sometimes be denoted by 9(x),..., x,,) if 
the variables x; are distinct and all free variables in g belong to the x;. 


If a formula ¢(xj,...,xX,) and elements aj,...,a,, € A are given, one 
defines & F g[aj,...,a,] by & F gla], where a is an assignment with 
ct(x;) = a;, which is well defined by the previous proposition. 


Thus, g(x, ..., X,) defines an n-ary relation on the structure Y%, given 
by ¢[2] := {(ay,...,a,) EA” | WE glay,..., ay, ]}- 

In particular, when ¢ is a sentence, the relation 

AE 

can be interpreted as “9 is satisfied (or true) in X” or “2 is a model of ¢”. 
Example. An Lring-Structure (K;0,1,+,—,-) is a field if and only if it 
satisfies the field axioms (1)-(10) given previously. 
Definition. Let 2% be a structure and D C A”. 


* The set D is called Q-definable in & if D = ¢[2] for some for- 
mula 9(X), ...,X,). 

« Let B C A bea parameter set. Then D is called B-definable in & 
if there exist a formula P(X), ...,Xy,1.-->Ym) and b € B™ such 
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that D is equal to the set 
g[M,b] := {GE A™ | WE Gfay,...,y,b,,..., Dm}. 


Exercise 2.3.3. 


(1) Let 2 be an £-structure and B a non-empy subset of A contain- 
ing the interpretations c”™ of all constants in £ and which is 
closed under all operations f™. Restricting the interpretations 
of all symbols from the set A to B one obtains an £-structure 
%, called substructure of X&. We shall write B C 2% when B isa 
substructure of 2. 

Prove that the intersection of a family of substructures of 
& is either a substructure or empty. Deduce that any non- 
empty subset X of A is contained in a smallest substructure of 
%, called the substructure generated by X and denoted by (X)y. 
Prove that the base set of (X), is given by 


{t™[a] | te 7% anda: VX}. 


(2) Let & = (A;0,1,+,-,-) be an Zring-Structure. One assumes 
that & is a ring. Prove that the substructure generated by a 
subset X of A is the subring generated by X. 


2.4. Substitution 


The aim of this section is to provide a satisfactory notion for the substi- 
tution in a formula ¢ of a variable x by a term s, in such a way that the 
expected semantical properties (cf. Proposition are satisfied. 


One might simply decide to replace every free occurrence of x in 
by s, but it might then happen that some variable in s becomes bound by 
a quantifier in the resulting formula. This could have unpleasant con- 
sequences, as for the formula ¢(U,) given by 4v,;7v, = Up, with x = Ug 
and s = v,, for which one would get the formula 4v,7v, = v, which is 
satisfied in no structure, while as soon as 2% contains at least two distinct 
elements, %& F g[a] for anya € A. 


The following somewhat more complicated definition remedies this 
issue. 
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Definition. Let Xo,...,x, be distinct variables and spo,...,5, be terms. 
One defines the similtaneous substitution of the x; by the s; as follows. 


(1) Let t be aterm. Then ¢,,/.,..\s,/x, = g/g is the word obtained 
eds at lad 
by “simulatenously replacing all occurrences of x; in t by s;”, 
that is, one sets 
if x 4X, ...,X # Xp, 
SM S/e ; 
s, ifx=x;, 
° Cox = C, : 
- [ftl--t lag = fii, Hz, inductively. 
(2) By induction on the height of a formula, one sets 
> [t=t'];q equal tot; =t., 
1 1 
+ [Rt’ --- t"]5,z equal to R= +++ Hz, 
* [-]5q@ equal to >[~];,z, and finally 
* (PA syx equal to (P5/z A 5x). 
+ Let xj,,...,Xi, (i < .. < i,) be those variables among Xo, 
..,X, that are free in 4x7. (in particular one has 
X FX yey X F Xz) 
If x has no occurrence in any of s;,,..., 5;,, one sets 


[Axp sx equal to AxlY]s,, /Xi, seSig, Nip . 
If x has some occurrence in one of 5;,,..., S;,, one sets 
(*) [Axp]5z equal to SUD sy, iy wnSiy, Xi, tl 


where u is the first variable appearing in the enumera- 
tion Up, Uj, U2, ... Which does not occur in any of the words 
xy, Siz > Si, 
Exercise 2.4.1. Prove the following by induction on height: 


(1) Ift is a term, then t;,z is a term. 


(2) If g is a formula, then g5,; is a formula. Moreover, one has 
ht(g¢5z) = ht(¢). 


Let Xo,...,X, be distinct variables, a an assignment with values in 2 
and dp,...,a, elements of A. One defines the assignment aq, /x,,..a,/x» = 


Aayx bY Ag/x(Xi) = aj and agx(y) = a(y) ify # x; for every i. 
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Proposition 2.4.2 (Substitution Lemma). Let Xp, ..., x, be distinct vari- 
ables, So,...,S, terms and a an assignment with values in 2. 


(1) For every term t one has tla] = (% [se tcrhrxps.nstfeeVx, | 


(2) For every formula ¢ one has 


CY/X0 pues [OL]/Xp | : 


WE g5z[a] ifand only if AF [ax 


Proof. (1) is easy. One proves (2) by induction on the height of ¢, the 
only non-trivial case being when ¢ is of the form Ax. As in the defini- 
tion of [Axp];,z, one denotes by x;,,...,x;, the variables that are exactly 
those among the x; which are free in 4x. Assume first that x occurs 
in one of the terms Sips es Sigs and let u be the variable chosen as in (). 
Then 


AE [Axp]5<lo] —=> AF JulP|s,, xi, sonSig ig lx 
<> there exists a € A such that 2% F Psp, py woaSiyg/ iy sl [Cau] 
<> (by the induction hypothesis) there exists a € A such that 
DE F plore [ 
iy 


<=> there exists a € A such that & F pla 


a 
anally 984 [anal X40 arall | 


si [a\/xi, roS jp [a]/xi, diel 
(by Lemma since u does not occur in any of the sj, ) 

=> WE Axpla. 
y 


al/xi, sd [al/xi, ] 


=> 2 F Axel 5p erH/2¢9 958 fetl xp] 


The last equivalence follows from Proposition since the as- 


signments a,x and ox .x coincide on the set 
4Y 


Free(Axy). 


[a\/xi, jeep [o]/xi, [e]/X0,-S7[C]/Xp 


"k 
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Assume now that the variable x has no occurrence in any of the 
terms 5j,,..., Si,- Then 


AF [Axp|sela] => AF FLD s,, lai, wig Xi [cr] 

<> there exists a € A with & F Psp, 1, Sty Xp [ea/x] 

<=> (by the induction hypothesis and since x # x;,,...,x # X;,) 
there exists a € A with & FE pla 


DI DE 
SF otayel Xi, v8 [ard 0x! 


=> 2 F Axle [ol/xi, jig [a/xi, | 


ik 
=> DE F xp 6c} /29 5% [otl/xp > 


Example 2.4.3. 


(1) Let (up) be the formula 4v,7v, = Up, and let s = v,. Then 
Ps/vp is equal to Jv2[701 = Vo], /u9,v2/v,> thatis, to Fv, 72 = vj}. 


(2) Let g be a formula and x),..., x, be distinct variables. One as- 
sumes that s,,..., 8, do not contain any variables having an oc- 
currence in g. Then ¢;/; is the word obtained by simultane- 
ously replacing every free occurrence of x; by s;. 

We prove this by induction on ht(¢), the only non-trivial 
case being when ¢ is of the form 4xp. Let ij < .. < ix be 
chosen as in the definition of g;,;. By assumption, x has no 
occurrence in any of 5;,,...,5;,, and so by definition [Axd]5z 

equals Ax, 12jy v-S gg iy” As the Sij do not contain any vari- 

ables occurring in 7, Psi, [iy w-eSigg!Xiy is inductively given by si- 
multaneously replacing every free occurrence of Xi, by Si,» for 

j =1,...,k. This proves the result. 


Lemma 2.4.4. If y is a variable with no occurrence in ¢, then [Pyx].. fy is 


equal to 9. 


Proof. The proof is by induction on ht(¢), the only non-trivial case be- 
ing when 9 is of the form 4zy. If x is not free in g, then [Pyx], i equals 
Px/y by Example 2.4.3(2), and so is equal to g by definition, as y does 
not occur in ¢ at all. We may thus assume that x is free in Az, so in 
particular z # x and z # y. Hence by definition [[az~] yx], ” is equal to 
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[22lY] yx]. which is equal to 4z [TH] yx]. But [Yyx],./ equals ~ by 
the induction hypothesis. 


Notation. Let sj,...,5, be terms. If t(x),...,x,) is a term, we shall often 


geen Xp? So FAYL9 88 oO SEO ME TS LX gooey 


if g is a formula of the form 9(x),...,X,). 


2.5. Universally Valid Formulas 


Definition. An £-formula ¢ is called universally valid, written F 9, if it 
is satisfied in every £-structure %& and for any assignment a with values 
in Y, that is, if & F g[a] for every 2 and every a. 


Remark 2.5.1. The formula 9(x,,...,X,,) is universally valid if and only 
if the sentence Vx,,..., X, is universally valid. 


We call Vxj,...,X,g a universal closure of ~. We will write & F @ for 
WE VXy,...,XnP- 


Example. 


(1) 4x x = xis universally valid. (By definition, a structure is non- 
empty!) 

(2) Let gy and p be £L-formulas. Then (g > (W — ¢)) is universally 
valid. 


The fact that we do not mention the language £ in the notation F g 
is justified by the next lemma. First some terminology. 


Definition. Let £ C L£’ be two languages. If 2’ = (A; (2A yen) is an 
£'-structure, we set &% := A’ f, := (A; (2 rents the reduct of 2' to 
£C £’. The structure 2’ is called an expansion of & to L’. 


Lemma 2.5.2. Let pg bean £-formula and L£' D £. Then ¢ is universally 
valid as an £-formula if and only if it is universally valid as an £'-formula. 


Proof. One may identify assignments with values in 2 and those with 
values in 2’, and one has 2’ F gla] <> °F gla] for any assignment 
a. Thus it suffices to prove that any £-structure 2% has an expansion to 
some £’-structure, which is clear. 
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We now specify a framework for formulas like (g —- (~ > 9)) 
which are always satisfied independently of the truth values of g and 
of ~, namely propositional calculus. 

One fixes a set P = {p; | i © N}, where the p; are called proposi- 
tional variables (they will take only “true” or “false” as values). 

Propositional calculus formulas are defined as words over the alpha- 
bet P U{-, A, (,)}, formed according to the following rules: 


* p; is a formula for any i; 


« if F and G are formulas, then -F and (F AG) are formulas, too. 


Let Fml» denote the set of propositional calculus formulas. As be- 
fore we introduce the abbreviations V, > and «. 

One identifies Z/2 = {0,1} with {“false”, “true”}, by assigning 0 to 
“false” and 1 to “true”. An assignment is a function 6 : P > {0, 1}. 


Any assignment 6 induces a function 6* : Fmlp — {0,1} defined 
by 5*(p;) = 5(p;), 6° (AF) = 1 — 6*(F) and 6*((F A G)) = 6*(F) - 6*(G), 
inductively. If 6*(F) = 1, we also write 6 F F. 

One writes F = F(qj,..-, Qn) if the q; are distinct variables and all 
variables occurring in F belong to the qj. 


Exercise 2.5.3. For any function g : {0,1}" > {0, 1} there exists a propo- 
sitional calculus formula F(pj,..., p,) such that, for any assignment 6, 
one has 6*(F) = g(6(p)), ..-,O(pn))- 


Definition. 


(1) One says that a formula F € Fml, is a tautology for the propo- 
sitional calculus if 6 — F for any assignment 6. This can be 
rephrased by saying that F = F(qj,..., q,) is true for any choice 
of truth values for the q;. 


(2) An £-formula ¢ is a tautology for the predicate calculus if there 
exists F = F(qj,..., Qn) a tautology for the propositional calcu- 


gieay 


(the word obtained by replacing the variables q; by yj). 


Lemma 2.5.4. Tautologies for the predicate calculus are universally valid 
formulas. 
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Proof. Clear. 


Lemma 2.5.5 (Equality axioms). The following sentences are universally 
valid: 


¢ VU9 Up = Uo (reflexivity, E1) 
© WUo, U1 (U9 = Vy > Vy = Uo) (symmetry, E2) 
© Wo, Uj, U2((Ug = V1 A Vy = U2) > Up = U2) (transitivity, E3) 
n 
¢ V1, ..., Van (Aja Uj = Vian > FO, Un = fUngi U2n) 
forany f € Ff (congruence for functions, E4) 
n 
° Vuy,...5 Von((A jer UV; = Vigan A RV, +++ Uy) @ RU_41 Urn) 
for any R € Rf (congruence for relations, E5). 


Proof. Clear. 


Remark. The sentences (E1-E5) express that = is a congruence relation, 
that is, an equivalence relation compatible with the functions and the 
relations of the language. 


Lemma 2.5.6 (Quantifier axioms). Let y and wp be £-formulas. 
(1) For every variable x which is not free in y, the formula 
(Q1) Vx(p > ~) = (> Vx) 
is univerally valid. 
(2) For every variable x and every term t, the formula 
(Q2) Prix > Ax 
is universally valid. 
(3) For every variable x the formula 
(Q3) Ax 9 @ AVx ay 


is universally valid. 


Proof. (2) follows easily from the Substitution Lemma (Propo- 
sition 2.4.2), and (3) is clear. 

We will now prove (1). Suppose that x ¢ Free(¢), and that & F 
Vx(~ > p)[a], that is, & F -Ax (g A 7)[a], from which we infer 2 F 
(7g V P)[eq/x] for any a € A. We have to prove that &F (g > Vx y)[a], 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


48 2. First-order Logic 


and for this we may assume that & F ga]. But then & F g[ag,/,] by 
Proposition as x ¢ Free(¢), and so & F p[aq/x]. Since a € A was 
arbitrary, 2 F Vx ~[c] follows. 


Remark. The restriction on the variable x in (Q1) is necessary, as shown 
by the following example. Let both g and wp be equal to the formula 
x = c, where c is a constant symbol. Then F Vx(g > y), but in any 
structure 2 with a base set that contains at least 2 elements, one has 
WE Vx(~ > Vx). 


Definition. An £-theory is a set of £-sentences. 
Let T be an £-theory. 
* One says that the £-structure 2% is a model of T, which we de- 
note by &F T, if UF g for any g € T. 


« A sentence (or more generally a formula) ¢ is a logical conse- 
quence of T, denoted by T F 9, if for any £-structure 2% which 
is a model of T we have & F ¢ (that is, in case of a formula 
(X1,.-,Xp,), We require M& F Vxy,...,X), Y). 


By the proof of Lemma the fact that T F ¢ is independent of 
the language L. 


2.6. Formal Proofs and Gédel’s Completeness Theorem 


In this section we will prove Gédel’s Completeness Theorem, which 
states that any universally valid £-formula may be obtained using a 
finite deduction (a formal proof). This fundamental result establishes a 
perfect correspondence between semantic truth and syntactic provabil- 
ity in first-order logic. 

We will start by formalizing the notion of proof. 


By the logical axioms we mean: 


* predicate calculus tautologies, 
* the equality axioms (E1-E5) from LemmaP.5.5, 
* the quantifier axioms (Q1-Q3) from Lemma P.5.6 


There will be two deduction rules: 
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* (MP) Modus Ponens: From ¢ and g —> #, one can deduce 2. 


* Generalization: From ¢, one can deduce Vx (x any variable). 


Definition. 


(1) Let g be an £-formula and T be an £-theory. A formal proof 
of pin T is a finite sequence of £-formulas (¢,..., ~,) with 9, 
equal to g such that, for every i < n, either gy; € T, or 9; isa 
logical axiom, or it can be deduced by (MP) from some gj; and 
9, with j,k < i, or it can be obtained by generalization from a 
formula ¢; with j <i. 

(2 


New 


One says that ¢ is provable in T, denoted by T F, 9, if there 
exists a formal proof of g in T. We write, g if g is provable 
in the empty theory. 


A priori, F, depends on the language £ in which the formula @ 
and the theory T are considered, since if £’ D L£, there are more L’- 
proofs than £-proofs. However we shall see that T F 9 if and only if 
T Fg, 9, which shows in particular that the provability of a formula is 
independent of the language. 


In the sequel, we shall consider proofs (p, ..., @,) in which formulas 
y; whose provability has been previously established will be considered 
as axioms. It is always possible to transform such a proof into a formal 
proof; it is enough to replace g; by a formal proof of 9;. 


Example 2.6.1. 
(1) If gy and > are £-sentences, then {g, p} Fk, PA w. 


Now let g be an £-formula, x, y variables and t an £-term. Then we 
have the following: 


(2) Fe Vx > Pryx- 
(3) If y does not occur in g, then F, Vy gy/x > Vx 9. 
(4) Fy Vx~ > 9. 
Proof. (1) From ¢ and the tautology gy > (~ > (¢ A p)) one deduces 


wy = (pA) by (MP), from which one deduces gAy, using ~ and applying 
(MP) again. 
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(2) Here is a formal proof of Vx ¢ > ¢;/, (recall that by definition 
Vx y is the formula =Ax7¢): 


Po equals (“4/x > 4x79) (Q2) 
91 equals (3px > X79) > (AXP > x) (tautology) 
$2 equals Vx P > P4/x (MP). 


(3) The formula Vy gy), = [yx] is an instance of (2). Since 


x/y 
[Pyx], jy equals g by Lemma we get Vx (Vy Py/x > ) by general- 
ization, from which we conclude by the quantifier axiom (Q1) and (MP). 


(4) is a special case of (2), since ¢ is equal to 9,,,. 


Lemma 2.6.2 (Soundness). Let T be an £-theory and 9 an £-formula. 
Then 
Tre e>TF¢. 


Proof. The logical axioms are universally valid by Lemma Lemma 


2.5.5,and Lemmaf.5.6, Clearly, if T F gand TF g > yp, then T F y. Itis 
also easy to see that T F g implies T F Vx g. By induction on the length 


of a formal proof, we are done. 


Definition. Let T be an £-theory. 
* T is inconsistent if there exists an L-sentence ¢ such that Tk, 
gand Tk, 79. 
* T is consistent if it is not inconsistent. 
« Tis complete if it is consistent and for any £-sentence ¢ either 
Tk, porT Fs 79. 
Example 2.6.3. 
(1) Let 2& be an £-structure. Then the set 
Th(2l) := {y is L-sentence | 2F ¢} 
is a theory, called the theory of &. It is a complete theory (its 
consistency follows from Lemma P.6.2). 


(2) The theory ofalgebraically closed fields is the £,jng-theory ACF 
which is composed of the field axioms together with a sentence 
Xp for any n > 1 expressing that any polynomial of degree n has 
aroot. For instance, the formula y,, given by VZp.....2,-1 IX(x"+ 
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Zy—1X"!+...+ Z = 0) works; that formula can be rewritten in 
Kring: 
Remark 2.6.4. 
(1) An £-theory T is inconsistent if and only if T +, g for any 
£-formula ¢. 
(2) If T kg 9, there exists Ty C T finite such that Ty Fg 9. 


Proof. Exercise. 


Corollary 2.6.5. 
(1) Let T be an £-theory such that all finite subsets of T are consis- 
tent. Then T is consistent, too. 


(2) Let (Tj)jer be a family of consistent £-theories, with T; C Tj or 
T; C T; for anyi, j € I. Then T = her T; is consistent, too. 


Lemma 2.6.6 (Deduction Lemma). Let 7 be an £-sentence, T an L- 
theory and y an L-formula. Then 


TU{z}Fe vifandonlyifT ty, xy > gy. 


Proof. Itisclear that T+, y > yimplies that TU{y}, y. Conversely, 
let (po, ..-, Pn) be a formal proof of gy in TU{y}. By induction on i, we shall 
prove that TF, (vy > gj). If g; equals y, this is clear, andif T F¢ gj, it 
follows from (MP) and the fact that (9; ~ (vy > 9;)) is a tautology. This 
implies that T+, (vy > ¢;) when ¢; is a logical axiom or an element of 
T. 


If g; is deduced by (MP) applied to g and 9, for j,k < i, so 
equals g; — gj, itis enough to use (MP) together with the fact that ((y > 
RAK > G > Gi) > % > ¢})) is a tautology. 

Finally, if g; is obtained by generalization from gj; for j < i, so 9; 
equals VxQ;, we have Tk, y > Qj by induction, from which we deduce 
T Fy Vx(x > g) by generalization. Now, y being a sentence, the 
formula Vx (vy - ¢) — (vy — Vx) isa quantifier axiom, so we conclude 
that TF, y > Vx by (MP). 


Corollary 2.6.7. Let T bean £-theory and pan L£-sentence. ThenT kg g 
if and only if T U {79} is inconsistent. 
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Proof. The ‘only if’ part is clear. Conversely, if T U {=@} is inconsistent, 
then T U {7-9} Fg ¢ by Remark 2.6.4(1). It follows from the Deduction 
Lemma that T +, 7g —> 9. Since (-g > ¢) > gis a tautology, we 
conclude by (MP). 


Lemma 2.6.8 (Simulation of constants by variables). Let p~ be an L- 
formula, T an £-theory, and let C be a set of constant symbols such that 
L£NC=8. 


(a) Let x bea variable and c € C. The following are equivalent: 
CQ). ree 
(2) T F cute} Depxs 
(3) T F cute} yp. 

(b) Onehas Tk, p ifand only if T Fuc wv. 


Proof. For ease of notation we shall assume T = 9; the proof for general 
T is just the same. Let us start by proving (a). 

(1)>(3) is clear since any £-proof is an £ U {c}-proof. 

(3)>(2). If Feu , then generalization yields F-4.; Vxp. Using 
Example 2.6.12) and (MP), one deduces F pyre} Pex 


(2)>(1). For ¢ an £ U {c}-formula and y a variable not occurring 
in @, we denote by ¢ and also by ¢,,/, the word obtained by substituting 
every occurrence of c in ¢ by y. 


Let ¢, p and ¥ be L u {c}-formulas with no occurrence of y. Then 
the following statements hold: 
(i) gis an £-formula. 
(ii) Poy is equal to ¢. 
(iii) If @ is an equality axiom (respectively a tautology or a quanti- 
fier axiom), then ¢ is so, too. 
(iv) If ¢ is obtained using (MP) from # and Z, then ¢ is obtained 
using (MP) from w and y. 
(v) If Gis obtained using generalization from #, then ¢ is obtained 
using generalization from ~. 


One proves (i) by induction on the height of ¢, and (ii) follows from 
Example 2.4.3(2). The statements (iv) and (v) are immediate, and also 
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the case of an equality axiom, a tautology or an instance of (Q1) or (Q3) 
in (iii). It remains to check the case of an instance of (Q2) in (iii): If ¢ 
is of the form (6; Ix > 4x65), one has to prove that the formula g, which 
is given by ([5/x]ye > Jx[5]y/c), is an instance of (Q2). This follows 
from the fact that [6, ixlyje and [Syiclt, jeix are equal, which is proved by 
induction on height. 


Now, let (¢°, ..., $) be a formal proof of ,,, in the language £ U{c}. 
Choose a variable y not occurring in any of @°,...,@’" and such that y # 
x. 


One deduces from (i)-(v) that (¢°, ...,¢’") is a formal proof in the 
language L of [~, il, jo Which is equal to ty/. by Example B.4.3(2). 

By generalization one gets +, Vy yx. Using Example 2.6.1(3,4) 
and (MP), one gets, ~, which completes the proof of (2)>(1). 

Item (b) is a direct consequence of (a). Indeed, as any £ U C-proof 
involves only a finite number of elements of C, one may assume that 
C is finite. One concludes by induction on the cardinality of C, using 


(1)$(). 


Lemma 2.6.9. Let T be an £-theory, p(x) an £-formula andc € La 
constant symbol not occurring in T U {p(x)}. Assume that T is consistent. 
Then T U {Axp > 9,/x,} is a consistent £-theory. 


Proof. Ifthe conclusion does not hold, then T Fk 4x pA7¢G,/ by Corol- 
lary 2.6.7, In particular, T+, 4x y, and, by LemmaP.6.8(a), TF, 79, 
so T +, Vx 7@ by generalization. Using the quantifier axiom (Q3) and 
(MP), it follows that T is inconsistent. 


Theorem 2.6.10 (Gédel’s Completeness Theorem). Let T be an L-theory 
and gy an £-sentence. Then T F 9 if and only if T F, 9. 


Remark. Since F does not depend on the language 4, a posteriori F, 
does not depend on £ either. Thus, once Gédel’s Completeness Theorem 
is proved, we shall write + instead of F;. 


Proof. T+, ¢=>T FE 9. This is the easy direction in Gédel’s theorem 
and it expresses that our notion of formal proof is sound. It holds by 


Lemma 2.6.2. 
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TE@=>T FE, 9. This is the non-trivial direction. It will follow 
from the next theorem, which is an existence statement for models. In 
fact, as 


TEo <— TU {79} has a model (by definition) and 
TK, @ <— TU {79} is consistent (by Corollary 2.6.7), 


this theorem is equivalent to the Completeness Theorem. 


Theorem 2.6.11. A theory has a model if and only if it is consistent. 


Proof. The easy direction was already proved. To prove that any con- 
sistent theory T has a model, we will start by constructing an expansion 
Tt of T in some language £*+ D L which has better properties. Let us 
begin with a definition. 


Definition. Let £ be a language and C a set of constant symbols with 
£AC =. One says that an £ U C-theory T+ admits Henkin witnesses 
in C if for any £ U C-formula g(x) there exists c € C such that xg > 
Pelx € ae 


If & is an L-structure and A = {a, | c € C}is an enumeration 
(possibily non-injective) of its base set by C, one denotes by At the LU 
C-structure obtained from % by interpreting c by a,. Then Th(2t) is 
a complete theory which admits Henkin witnesses in C. In fact, any 
complete theory which admits Henkin witnesses in C is of this form: 


Proposition 2.6.12. Any complete L UC-theory T* which admits Henkin 
witnesses in C has a model A+ consisting of constants of C, that is, with a 
base set of the form At = {c™ | ce Ch. 


Proof of Proposition 2.6.12. We observe that replacing T+ by the set 
of £ U C-sentences g such that T* Fy yc g does not change the as- 
sumptions. We may thus assume that T* is deductively closed, that is, 
Tt Fyyc vif and only ifg € Tt. 

Note that it follows from the equality axioms that the binary relation 
c~d: — c=d €  T* onC isan equivalence relation. We set 
a, :=c/~and At := {a, | c € C}. Then At # G, as clearly C # @. Let 
us define an £ U C-structure %+ on At as follows: 
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* For d a constant symbol, we set dw := a. ifc = d € T*, 
where c € C. Note that such a c always exists. Indeed, d = 
d > Axx = d € Tt by (Q2), thus dxx = d € Tt by (MP); 
since T* admits Henkin witnesses in C, there existsc € C such 
that xx =d > c=d€ Tt, whencec = d € Tt by (MP). 
Furthermore by the equality axioms d*” does not depend on 
the choice of c. 


- For R € RX we set 
(ie sande) ER : > Re, ---¢, € Tt. 
This is well defined by the equality axioms (E5). 
+ For f € F% we set 
+ 
PP Gast JSG, 1 SS fre, = Ge eT. 


This is well defined by the equality axioms (E4). By an argu- 
ment similar to the one given for constants one proves that f 4? 
is defined everywhere as a function. 


The following statements then hold: 
(I) Ift is an £ U C-term without variables and c € C, then 
mh -a, — t=ceTt. 
(II) Let » be an £ UC-sentence. Then At FY <> PET. 


One proves (I) by induction on ht(¢), using the equality axioms. 


To prove (II), first recall that ht() = ht(?5z) (cf. Exercise [2.4.1]. 
One argues by induction on ht(7). 

If ~ is of the form f, = fy, then (II) follows from (1). If p is of the form 
Rt, --- t,, one chooses c; € C such that i = a,,. Sincet; = c; € T +, one 
hasat ep —> Rec, ETT <> PET. The first equivalence 
follows from the definition of 2+, the second from (E5). 

If p is equal to (vy, A ¢2) it is clear that (II) for ~ follows from (II) for 
both ¢, and ¢3. If p is equal to =¢g, one has 


ATED SAFO —> GETt |S peET*, 


since Tt is complete and deductively closed by hypothesis. 
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Finally, suppose that ~ is equal to 4x. Let us note that » € Tt if 
and only if there exists c € C such that 9,/;, € T*. Indeed, since Tt 
admits Henkin witnesses in C, the ‘only if’ part holds. For the ‘if’ part, 
it is enough to use (Q2), (MP) and the fact that T+ is deductively closed. 
One may now conclude as follows: 

AteEp <> At F ¢g[a,] for somec EC 
(Substitution Lemma) <> 2+ F g,/, for somec € C 
(Induction Hypothesis) <=> 9,,, € T* for some c € C 


=—> pelt. 


Thus %&+t is a model of T* consisting of constants in C. 


The following lemma will be enough to conclude the proof of Theo- 
rem 2.6.11} because if 2+ is a model of the £+ = £ UC-theory Tt with 
Tt 2 T, then the reduct of 2+ to the language £ is a model of T. 


Lemma 2.6.13. Forany consistent £-theory T there exists a set of constant 
symbols C disjoint from £ such that if £+ denotes the language LUC, then 
there exists a complete £+-theory T+ containing T that admits Henkin 
witnesses in C. 


Proof. To any £-formula with a free variable g(x) one assigns a new 
constant symbol cg. Let C; be the set ofall cg. One sets £; = LUC, and 


T, :=T :=Tuaxg > Peg/x | (x) © Fml*}. 


Let us prove that the theory T, is consistent. Since a theory is consis- 
tent if and only if any finite subset is consistent, it suffices to prove that 
the theory T’ = Tu {ax;g! = Pe /x,} is consistent for any finite set of 


formulas {¢',...,@"}. The theory T being consistent as an £,-theory by 
Lemma P.6.8(b), this follows from Lemma by induction on n. 


We iterate that construction, with C, a set of new constant symbols, 
£Ly := £,UC, and the £3-theory T, := “ee By the argument just given, 
T, is consistent. By induction, T,, being constructed, it gives rise to a set 
of new constant symbols C,,,,. One sets £y4, = £, U Cy4, and the 
Ly41-theory Ty41 := Ty is consistent. Set C := seer, C, and £t+ := 
LUC. Thus (T,)nen is an increasing sequence of consistent £* -theories. 
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The £*-theory T’ := U,,<a, Tn is then consistent by Corollary 2.6.5, By 
construction, it admits Henkin witnesses in C. 


Since any £+-theory S’ containing T’ still admits Henkin witnesses 
in C, it suffices to prove that any consistent £+-theory is contained in 
some complete £+-theory. This follows from Zorn’s Lemma. Indeed, 
the set 


S§ = {S' DT’ | S’ isaconsistent £t-theory} 


is non-empty and partially ordered under inclusion. As the union of a 
chain of consistent theories is still consistent by Corollary this or- 
dering is inductive. Hence, by Zorn’s Lemma, there exists a maximal 
element in S, that is, there exists an £+-theory T+ D T’ which is maxi- 
mal consistent. Let g be some £t-sentence. If T+ K+ o, the £+-theory 
T* U {79} is consistent by Corollary One deduces that -g € T* 
by maximality, hence T* is complete, which finishes the proof. 


When £ is countable, one can provide a more direct construction 
for the theory in the lemma that we shall now sketch. One fixes a set 
C = {c, | n € N} of new constant symbols and one enumerates the 
set of £L U C-sentences via $9, %),$2,.... By induction, one constructs 
an increasing sequence (T,,) nen of consistent £ U C-theories with the 
following properties: 


* Ty) = T, and T,, \ T is finite for any n € N; 
© On © Thy1 OF APy € Th 413 


- if y, is of the form 4x € T,,,1, then there exists c € C such 
that Welx E Ths1- 


Once such a sequence (T;, nen is constructed, it is enough to set Tt := 
Ge en Jn: Then Tt is a complete theory by construction, and it admits 
Henkin witnesses in C. To prove the latter, consider some £UC-formula 
w(x). Then there exists some n € N such that Axyw is equal to ¢,. If 
Gn € T*, then by construction ~,/, € T+ for some c € C, hence Ax > 
Yoix € T*. Otherwise, one has =Axp € Tt, hence Axp > yp), € Tt 
for any c. 

Assume T,, is constructed. If T,, U{g,,} is consistent, one sets i, equal 


to g,. Otherwise, i, is set equal to =¢,,. In both cases, the theory T,, U 
{p,} is consistent. Ifi,, is not of the form 4xy, one sets T, 4; := T,U{Py}- 
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Otherwise, let m be minimal such that c,, has no occurrence in T,, (such 
an m exists since T, \ T is finite). One sets T,41 >= TrU{Wn» Xc,,/x}- The 
theory T,41 is consistent by Lemma 2.6.9, 


2.7. Exercises 


Exercise 2.7.1 (Compactness in propositional logic). Let Fmlp be the 
set of propositional formulas over an infinite set P of propositional vari- 
ables. A set 2 C Fml» is called satisfiable if there is an assignment 


6 : P > {0,1} such that 6*(F) = 1 for all F € 2%; it is called finitely 
satisfiable if every finite subset Xp C 2X is satisfiable. 


The aim of this exercise is to prove the following result (Compact- 
ness Theorem in propositional logic): A set & C Fml» of propositional 
formulas is satisfiable if and only if it is finitely satisfiable. 


(1) Prove the theorem in the special case when X is complete, that 
is, when for every p € P one has p € Lor7p € X. 


(2) Prove that every finitely satisfiable set of propositional formu- 
las is contained in a finitely satisfiable set which is complete. 


(3) Conclude. 


Application 1. Let k be a natural number. A graph G = (G, E£) is said to 
be k-colorable if there is a coloring of its set of vertices G with k colors 
such that any two vertices which are connected by an edge have differ- 
ent colors. Prove that a graph G is k-colorable if and only if every finite 
subgraph of G is k-colorable. 


Application 2. For n € N and a set A let #,(A) be the set of n-element 
subsets of A. 


(a) Prove Ramsey’s Theorem (infinite version): For any natural 
numbers n,k and any f : %,(N) > {0,1,...,k — 1} there is an 
infinite subset A of N such that the restriction of f to R,(A) is 
constant. 

[Hint: Argue by induction on n € N. Given some function f : 
P,44(N) > {0,1,...,k—1} and some natural number a, consider 
the induced map f, on P,(N \{a}) defined by f,(S) = f(SU{a}). 
Inductively construct an increasing sequence 0 = dg < a, < 
a, < --- in Nand a sequence of infinite sets N = Xp 2 X, 2 
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X, 2 -:- such that for every i one has a; € X;, Xj4, N[0,a;] = G 
and fy, is constant on #,(X;41).] 


(b) Deduce Ramsey’s Theorem (finite version): For any natural 
numbers m,n, k there is a natural number N such that for any 
function f : %,({0,1,...,N —1}) > {0,1,...,k — 1} there is an 
m-element subset A C {0,1,..., N — 1} such that the restriction of 
f to B,(A) is constant. 


Exercise 2.7.2 (Resolution in propositional logic). We keep the context 
of the previous exercise. Let F,G € Fmlp. We say that F and G are 
logically equivalent if (F < G) is a tautology. A literal is a formula of 
the form p or ap for some p € P; a clause is a disjunction of literals. 
By convention, the empty clause, denoted by [J is a formula which is 
always false. 


Observe that any formula F is logically equivalent to a conjunction 
of clauses. 


¢ Ifa literal L occurs more than once in a clause C, a clause D 
obtained by deleting one occurrence of L in C is called a sim- 
plification of C. 


« Given clauses C = C, V pV C, and D = D, V =p V D,, where 
p € P, we say that E = C, V C, V D, V Dy is obtained from C 
and D by the cut rule. 


« A resolution sequence from a set of clauses C is a finite sequence 
of clauses (Co,...,C,,) such that for any i < n either C; € C, or 
C; is a simplification of C; for some j < i, or there exist j,k <i 
such that C; is obtained from C; and C, by the cut rule. 
We say C is refutable if there is a resolution sequence from 
€ ending in [1]. 
The aim of this exercise is to prove that a set of clauses is satisfiable if 
and only if it may not be refuted. 
(1) Prove that the resolution method is sound: if a set of clauses 
may be refuted, then it is unsatisfiable. 


(2) Observe that a clause C is a tautology if and only if there is a 
propositional variable p such that C contains the literals p and 


ap. 
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(3) Let €C be a set of clauses and p € P. If C is a clause, let Cp=0 
be the clause obtained from C by deleting all literals which are 
equal to p. Consider 


Cp=0 = {Cp=0 | C does not contain the literal +p}. 


Similarly define Cp=1 and Cp=1, interchanging the roles of =p 
and p. 
(a) Prove that € is satisfiable if and only if C,-» or Cp, is 
satisfiable. 
(b) Prove that C,—o is refutable if and only if there is a reso- 
lution sequence from @ ending in [ or in p. 
(c) Prove that @ is refutable if and only if both C,_» and Cp-; 
are refutable. 


(4) Prove that the resolution method is complete: ifa set of clauses 
is not satisfiable, it may be refuted. 


[Hint: Prove the result for finite sets of clauses by induction on 
the number of variables, and then use compactness (cf. Exer- 
cise 2.7.1]) to deduce the general case from this. ] 


Exercise 2.7.3 (Interpolation). Let y be an £,-sentence and let # be an 
£5-sentence such thatF g > wp. Set Lp := £,N Ly. The aim of this 
exercise is to prove Craig’s Interpolation Theorem: There exists an Lo- 
sentence @ such that F p > OandF 6 = yp. 


We may assume that £, and £, are countable. Assume for contra- 
diction that no such @ exists. Let C be a countably infinite set of new 
constant symbols and set £7 = £; UC, fori € {0,1,2}. For an £f- 
theory T and an £3-theory U we say that the pair (T, U) is inseparable 
if there is no £3 -sentence 6 such that T F 6 and UF 76. 


(1) Prove that the pair ({¢}, {-~)}) is inseparable. 
(2) Prove that if (T, U) is inseparable and y is an pis -sentence, 
then either (T U {y}, U) or (T U {7}, U) is inseparable. 


(3) Construct an increasing chain of finite £}-theories (T, )nen with 
Ty = {y} and an increasing chain of finite Lf -theories (U;,)nen 
with Up = {7p} such that, letting T, = Une Tr and U, = 
Unen Un: the following conditions hold: 
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(i) T,, is a complete deductively closed £}-theory which ad- 
mits Henkin witnesses in C, 
(ii) U, is a complete deductively closed £3 -theory which ad- 
mits Henkin witnesses in C, and 
(iii) the pair (T,,, U,) is inseparable. 
Moreover, deduce from (i)-(iii) that T,, N U,, is a complete 
£4 -theory which admits Henkin witnesses in C. 


(4) Prove that T,, U U,, has a model, and conclude Craig’s Interpo- 
lation Theorem. 


(5) Deduce Robinson’s Consistency Theorem: If T; are consistent 
£;-theories fori = 1,2 such that Ty = T, N T> is a complete 
£o-theory, then T, U T, is consistent. 


Exercise 2.7.4 (Beth Definability). Let £ be a first-order language and 
R ¢ £an n-ary relation symbol. Let T be an £ u {R}-theory. 


* We say T implicitly defines R if for any £-structure N with base 
set M and any n-ary relations R,,R, C M” such that (M, R;) 
and ($M, Ry) are both models of T, one has R,; = Rp. 


« We say that T explicitly defines R if there is an £-formula 
9~(X1,..,X,) Such that T F Vx(R(X) © ¢(x)). 


The aim of this exercise is to prove Beth’s Definability Theorem: The 
theory T implicitly defines the relation R if and only if it explicitly defines 
R. 


(1) Prove that if T explicitly defines R, then it implicitly defines R. 


(2) Assume that T implicitly defines R. Let R’ be a new n-ary rela- 
tion symbol and c),...,c, new pairwise distinct constant sym- 
bols. 

(a) Let T’ be the theory obtained from T by replacing every 
occurrence of R by R’. Observe that 


TUT’ EF R(qq,.05 Cy) > R' (Cy, 5 Cn) 


(b) Prove that there is a finite conjunction g of sentences in T 
such that 


DA R(Cqy Cy) EG’ > R'(Cq, 5 Cn)s 
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where ¢’ is the formula obtained from ¢ by replacing ev- 
ery occurrence of R by R’. 
(c) Prove that T explicitly defines R. 


[Hint: Use Craig’s Interpolation Theorem (see Exercise 
2.7.3). 


Exercise 2.7.5. If w is a word over the alphabet 2, a subword of w isa 
word u such that w = w,uw, for certain words w), w>. 


By induction on the height ofa formula ¢, one defines the set Sub(¢) 
of all subformulas of g as a subset of the set of its subwords. If g is atomic, 
then Sub(y) := {g}; if gp equals —~ or Jv; ~, then Sub(y) : = Sub()U{¢}; 
finally, if g equals (1 A #2), then Sub(y) : = Sub(%,) U Sub(q,) U {¢}. 


Now let T be an £-theory. Two £-formulas g(x) and ¢2(x) are said 
to be equivalent in T if T F Vxy,...,Xn(1 © $2). They are called logically 
equivalent if they are equivalent in the empty theory. 


Assume that g(x) is an £-formula and that (y,..., ¥,) is a subfor- 
mula of g. Let ~’ be equivalent in T to =, and assume that Free() = 
Free(?’) = {,..,¥m}. Let y’(x) be the formula one obtains when one 
replaces the subformula y in gy by ~’. Show that g(x) and ¢’(x) are then 
equivalent in T. 


Exercise 2.7.6 (Herbrand normal form). Let £ bea language. A formula 
gy is said to be in prenex normal form if it is of the form 


Qx1 see QmXm¥o: 


where Q; € {5, V} for 1 < i < mand @ is a quantifier-free formula. It is 
called existential if it is in prenex normal form and does not contain any 
universal quantifier. 


(1) Prove that for every £-formula g(x) there is an £-formula (x) 
in prenex normal form which is logically equivalent to g(x), 
that is, such that F Vx(g « yp). 


The language £” is obtained from L by adding, for every £-formula 
P(xX1,-.-,X,) with n free variables, a function symbol f, of arity n. 
If p(x,,...,X,) is an £-formula in prenex normal form, one defines its 
Herbrand form g” (xj, ..., X,) by induction on the length of ¢: 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021 for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


2.7. Exercises 63 


+ if (x),...,X,) is quantifier-free, one sets g”“(x,,...,X,) equal 
to 9(X}1,..., Xn); 

© if p(xj,...,X,) is of the form AypP(y, x,...,x,), one sets 
o" (X1, ..,X,) equal to dyp"(y, x1, ..,X,)3 

© if g(xj,...,X,) is of the form Vyp(y, xj,...,X,), one sets 
Ge Kip wngt, sequal toh" [5 Chg xin Xe) Mig hee Ra) 


(2) Let g be in prenex normal form. Prove that gk 9”. 


(3) Prove that for every £-formula g(x},...,x,,) in prenex normal 
form and for every £-structure M, there is an LY -expansion 
m* of M such that for every (Q),..,a,) € M", ifm F 
g[ay,...,Ay] then MM” KF —*[ay,..., ay]. 


(4) Let g be an L-sentence in prenex normal form. Prove that 
+ gy” if and only if g. In other words, there is an explicit 
recipe to construct an existential sentence in an expansion of 
the language which is universally valid if and only if the origi- 
nal sentence is universally valid. 


Exercise 2.7.7. Let £ be a first-order language containing at least 
one constant symbol, and let ¢ be an existential £-sentence, given by 
Ax,,...,X,, with p(x,...,x,) a quantifier-free formula. Prove that F g 
if and only if there are m > 0 and £-terms GH )i<i<n,1<j<m Not containing 
any variable such that 


EV ved...) 


l<j<m 


Exercise 2.7.8 (Omitting Types Theorem). Let T be an £-theory and 
TZ = N(Uj,...,U,) a set of £-formulas such that the free variables of the 
formulas in z belong to {vj,...,U,,}. One says z is a partial n-type (in 
T) if for any finite subset {9,, ..., 9} of z, the theory T U {3u jae gi} is 
consistent. 


* One says a model M of T realizes z if there exists a € M” such 
that IN F y[a] for any gy € z; otherwise, one says that M omits 
1. 
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« z is called isolated if there exists a formula (vj, ...,v,) with 
T U{du¢@} consistent such that T + Vu(g > ?) for any formula 
per. 


The aim of this exercise is to prove the following result, which is 
called the Omitting Types Theorem: Let £ be countable and T a consistent 
£-theory. One assumes that, for j EN, 7 = 2(vj,..., Un;) is a non-isolated 
partial type. Then there exists a countable model of T that omits all the 7;. 

We consider T and (7;) je as in the statement of the theorem. Let 
C = {c; | i © N}bea set of new pairwise distinct constant symbols and 
set L* = LUC. 


(1) Let po) be an £*-formula, and let T’ > T bea consistent L*- 
theory such that T’ \ T is finite. Prove that there exists c € C 
such that T” = T’ U {Aup~ > p(c)} is consistent. 

(2) Letc = (Cis enix _) be some n,-tuple of distinct elements of C, 

J 
and let T’ D T be aconsistent £*-theory with T’ \ T finite. 
Prove that there exists y(v),..., Unj) € 7; such that T” = T’u 
{=¢(c)} is consistent. 
[Hint: Use that, up to logical equivalence, T’ is given by T U 
{y(c, d)}, where y(v4,..., Unj> y)is some £-formula and da tuple 
of constant symbols in C disjoint from c.] 

(3) Prove that there exists a consistent £*-theory T* D T satisfying 
the following properties: 

(H) For every £*-formula (up) there exists some c € C not 
appearing in = such that Jupp > P(c) € T*. 
(O) For any j € N and any n,-tuple (¢;,,...,¢;,_) of distinct 
J 
elements of C there exists a formula g(v) € 7; such that 
79(Ci,; Seis Cin,) ET*. 
[Hint: Using suitable enumerations, construct T* as the union 
of an increasing chain (T,,)nexy with Ty = T and T,, \ T finite 
for every n.] 

(4) Let T* be a consistent £*-theory satisfying (H). Prove that for 
anyi EN, the set Xj = {j EN | T* Fc; = c} is infinite. 

(5) Conclude the Omitting Types Theorem. 
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Chapter 3 


First Steps in Model 
Theory 


Introduction 


First-order model theory deals with the relationship between theories in 
a first-order language and their models. It provides a way to treat prob- 
lems from other areas of mathematics (like algebra or combinatorics) 
with tools from mathematical logic. First-order logic has a rather lim- 
ited expressive power: we will see for instance that an infinite structure 
is never determined, up to isomorphism, by its first-order theory (Corol- 
lary B.2.4). In fact this apparent weakness is a strength and a reason for 
its efficiency. As we will see it is sometimes quite useful to be able to 
switch from one model of a theory to another. 


One of the main themes of this chapter is quantifier elimination, 
which in concrete examples often yields a particularly simple descrip- 
tion of all definable sets in models of the theory under consideration. In 
Theorem we provide a very useful semantical criterion for a for- 
mula to be equivalent to a quantifier-free formula in a given theory (a 
syntactic condition). This applies in particular to the theory of alge- 
braically closed fields which is studied in and other examples are 
considered in exercises. As a remarkable consequence of this study, we 
provide a statement and a proof of the Lefschetz principle from algebraic 
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geometry (Theorem which allows us to transfer results from alge- 
braically closed fields of characteristic zero to algebraically closed fields 
of sufficiently large characteristic. A beautiful application is provided 
by Ax’s Theorem (Theorem stating that injective complex polyno- 
mial mappings are always surjective, which is proved by reduction to 
finite fields. 


3.1. Some Fundamental Theorems 


Theorem 3.1.1(Compactness Theorem). Let T be a theory such that any 
finite subset of T has a model. Then T has a model. 


Proof. The theory T is consistent if and only if any finite subset of T is 
consistent. Thus the statement follows from Theorem 


Remark. Our proof of the Compactness Theorem uses Gédel’s Com- 
pleteness Theorem. For a more direct semantical proof, see Exercise3.7.5. 


Exercise. Fix a language £ and denote by X the set of £-theories T 
which are complete and closed under deduction. One may define a topol- 
ogy on. X in the following way. For g an £-sentence, one sets(g) :={T € 
X | g € T}, and one considers the topology generated by the family of 
all (gp). Prove: 


(1) (PA yp) = (p) N (YY) and (Ag) = X \ (y). In particular the sets 
(y) form a basis of clopen subsets. 


(2) The space X is compact and Hausdorff. 


Definition. Let IN and MN be L-structures. 


(1) We say that M and M are elementarily equivalent if they satisfy 
the same £-sentences, that is, if Th(M) = Th(). This is de- 
noted by M = MN. 


(2) Assume that IN C M. We say that I is an elementary substruc- 
ture of % (and MN is called an elementary extension of IM) if for 
every £-formula 9(xj,...,X,) and every tuple a = (qj,...,a,) € 
M" we have M F gla] if and only if N F y[a]. This is denoted 
by M < MR. 
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Remark. 
(1) IfM KX MR, then M = MN. 
(2) FMAM, then M = MN. 


The following example shows that if M@ C Mand M@ = Nit is not 
always the case that IM < MN. 


Example. One considers the two £,,q-structures M := (N;<)and M := 
(N*;<). One has M C M and M & MN, in particular Mm = N. On the other 
hand the extension IN C MN is not elementary, since IM F ~Ax x < 1 and 
ME Axx <1. 


Theorem 3.1.2 (Tarski-Vaught Test). Let M and MN be L-structures with 
M CN. Assume that for any L-formula ¢(Xo,...,X,) and any tuple a = 
(ay,...,a,) € M", if there exists b) € N such that NF gy[bo, al, then there 
exists dg © M such that N F g[dg, a]. Then M < N. 


Proof. By induction on ht(g(x,, ..., X,,)) one proves that for any a € M” 
one has MF gla] —> NE gla]. 

When ¢ is atomic, this is clear since M is a substructure of #. The 
case of logical connectives is also clear. 


Assume now that 9(x),...,X,) is equal to Axqtb(Xp,..., Xn). (AS Xo 
is not free in y, we may assume that x9 # x; fori = 1,...,n.) Leta = 
(Q,,...,@,) € M". Then 

ME pla] <> there exists ag € M such that M F p[ap, a] 
<=> there exists ag € M such that N F ~[dpo, a] 
<=> there exists b) € N such that N F w[bo, a] 
<= NF lal, 
where the second equivalence follows from the induction hypothesis 
and the third one from the hypothesis of the theorem. 


Notation. For a language £ we set card(£) = card(Fml’). 


Note that card(£) > card(7*). 


Theorem 3.1.3 (Downward Lowenheim-Skolem Theorem). Let M be 
an £-structure and A C M. Assume that card(M) > card(L). Then there 
exists an elementary substructure M, of M containing A which is of cardi- 
nality sup(card(A), card(Z£)). 
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Proof. Up to enlarging the subset A if necessary, we may assume that 
card(A) > card(Z). 

In this proof, when @ # B C M, we denote by BC M the base set of 
(B), the substructure generated by B. By Exercise 2.3.3] we have 


BH (t™ [bins dy) | t= t0eiynsk) SF, Disacy dy © BY. 


In particular, if card(B) > card(£), then card(B) = card(B), since there 
exists a surjection from J“ x U,, ex B” onto B. 


Let % := (A) and Ag := A. Assuming A; C M is already defined, 
one constructs A;,, as follows. For any £-formula 9(Xpo,...,x,,) and any 
n-tuple a € A}, if M F vb, a] for some by € M, one chooses c(¢g, a) € 
M such that M F gy[c(y, a), a]. One sets 

_ L 
Bi, t= A; U {c(y, a) | O(Xo,...,X,) € Fml™, ay,...,a, € Aj} 
and A;,, := B41, the base set of %j4, = (Biz1)m- 

Let My := U; ey4i- Then Mo (is non-empty and) contains c™ for 
any c € @“ and is closed under f™ for anyf € F%. It is thus the base set 
of a substructure NM, of M. 

Let ¢(Xo,..-,X,) be a formula, a € Mj and by € M such that M F 
glbp,a]. There exists N € N such that ay,...,a, € Ay. By the con- 
struction of Ay4, there exists cg € Ay+1 © Mo such that M FE ¢ylco, a]. 


Hence Mo < Mi by the Tarski-Vaught Test. It is clear that card(My) = 
card(A). 


Example 3.1.4. Let R = (R;0,1,+,—,-,<) be the ordered field of real 
numbers. 


(1) There exists R’ = ® with R’ non-archimedean, that is, con- 
taining an element ¢ > Osuch thatn-¢ =¢+...+¢ <1 for any 
a a7 


n times 
n EN. Such an ¢ is called infinitesimal. 


(2) There exists ®’ < ® such that R’ is not complete. 


Proof. (1) Let c be a new constant symbol and £ := Lo ying U {c}. Let 
Qn be the formula c + ...+.¢ < 1. One considers the £-theory 
‘<a ea ee 


n times 


T := Th(R) uU{0 <chu{y, | n © N}. 
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For any finite Ty C T there exists N € N such that g,, ¢ Tp for any 
m > N. The expansion of ® to £ obtained by interpreting c by 1/N ER 
is thus a model of Ty. By compactness, it follows that T has a model R’. 
The element ¢ = c®’ is infinitesimal, hence R’ is non-archimedean. 

(2) By the Downward Lowenheim-Skolem Theorem (for A = 9), 
there exists R’ < ® with a countable base set R’. Since ®’ is a field, 
@ C R’. Taker € R\ R’. Then the set {r’ € R’ | r’ < r} has no 
supremum in R’, since R’ is dense in R. 


Remark. One may prove that (Raig: +,—,0,1,-,<) < ®, where Raig = 
{r € R | there exists 0 4 p(X) € Q[X] such that p(r) = 0}. 


Remark. Suppose that NW < M’ and D C M” is a definable set (with 
parameters). Then D extends canonically to a definable set D’ C M"" in 
§N’, such that D’ N M" = D. Indeed, if D = g[M, b] for some formula 
(x, y) and tuple b from ®, then D’ = g[M’, b] has the desired property, 
and it is independent of the choice of g and b. 


Definition. Let P be a property that an £-structure may or may not 
satisfy. We say that P is axiomatizable (resp. finitely axiomatizable) if 
there exists an £-theory T (resp. an £-sentence gy) such that for any 
£-structure M, the property P is satisfied by MM if and only if M@ F T 
(resp. I F gv). In this case one says that T (resp. y) axiomatizes the prop- 
erty P. 


Proposition 3.1.5. Let P be a property that an £-structure may or may 
not satisfy. Then P is finitely axiomatizable if and only if P and its negation 
are both axiomatizable. 


Proof. If gaxiomatizes P, then = axiomatizes its negation. Conversely, 
assume that T axiomatizes P and that T’ axiomatizes the negation of 
P. The theory T U T’ being inconsistent, there exist 9),...,¢, € T, 
Q1>-->Pm € T’ such that {9),..., Pn, P1,---» Pm} is inconsistent. Hence 


n m 
meETome \o ome \g>MET' > MET, 


i=1 i=1 


showing that the sentence ee g; is a finite axiomatization of P. 


Example 3.1.6. 
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(1) For any prime number p, fields of characteristic p are finitely 
axiomatizable in the language L, ing. 


(2) Fields of characteristic 0 are axiomatizable in L,ijng, but are 
not finitely axiomatizable. Indeed, if a sentence gp) were a fi- 
nite axiomatization, it would be a consequence of the theory 
Prieia U{T1+..+1=0 | p prime}, and then 

p times 
Prields U{71+...+1=0 | pprime and p < N}F Gp 

p times 
forsomeN €N. This would lead to a contradiction, since there 
exist fields of characteristic p > N. 


(3) Archimedean ordered fields are not axiomatizable (cf. Exam- 
ple B.1.4(1)). 
(4) Complete ordered fields are not axiomatizable (cf. Exam- 


ple B.1.4(2)). 


The last two examples make more precise what we said at the be- 
ginning of Chapter 2 


3.2. The Diagram Method 


In practice, one often wishes to construct a model of some theory which 
contains a given structure as a substructure or as an elementary sub- 
structure. In this section, we will see that these properties may be en- 
coded in suitable theories. 


Let IN be an L-structure. We consider the language Ly, obtained by 
adding to £ a new constant symbol c,, for each element m of M. The 
structure I admits a natural expansion to the language Ly, by inter- 
preting c,, by m. We denote M* the corresponding £y,-structure. 


By abuse of notation, we shall sometimes identify m and c,, in what 
follows. This will not cause any trouble, because of the Substitution 


Lemma (Proposition 2.4.2). 
Definition. 


(1) The complete diagram of MN, denoted by D(M), is defined as 
Th(M*). It is the set of Lyy-sentences P(Cm,,--»Cm,,)» Where 
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9(X1,..,X,) is an L-formula and m € M"” is such that M F 
g[m,..., My]. 

(2) The simple diagram of M, denoted by A(M), consists of all Lyy- 
sentences 9(Cy,,---» Cm, )» Where 9(X},..., X,) isa quantifier-free 
£-formula and m € M” isa tuple such that MF g[m,..., my]. 


Proposition 3.2.1. Reducts to the language £ of models of D(M) corre- 
spond, up to £-isomorphism, to elementary extensions of M. 


Proof. Let %’ F D(M) and N := MN’ ¢ be its L-reduct. The map 
mb ch provides an isomorphism between § and an elementary sub- 
structure of 9. 


Conversely, if M < M, we consider the £,,-expansion MN’ of M ob- 
tained by interpreting c,, by m. We will prove that 9’ F D(M). For 
this, let P(CnisaaCm,) € D(M). By definition, this means that Mm F 
gl[m,..,My,], so N F gvlmy,...,m,] by elementarity, and finally %’ F 
P(Cm,> +> Cm,,) by the Substitution Lemma. 


Proposition 3.2.2. Reducts to the language £ of models of A(M) corre- 
spond, up to £-isomorphism, to extensions of M. 


Proof. Similar to the previous proof. 


Theorem 3.2.3 (Upward L6wenheim-Skolem Theorem). Let §% be an 
infinite L-structure for some language £, and let x be a cardinal such that 
x > sup(card(M), card(£)). Then there exists an elementary extension of 
§M which is of cardinality x. 


Proof. It is enough to construct an elementary extension 9 of N of car- 
dinality > x, and to apply the Downward L6wenheim-Skolem Theorem 
to a subset A C N of cardinality x containing M. 

For any i € x, let c; be a new constant symbol. Consider the lan- 
guage £ := Ly U {c; | i € x} and the £-theory 


T :=D(M)U {re =G |i <j < x}. 
Then T is finitely satisfiable. Indeed, if Ty is a finite subset of T, inter- 


preting the new constant symbols ¢j,,..., ¢;,, occurring in To by distinct 
elements of M one gets an expansion of §N* which is a model of Ty. (This 
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is possible since M is infinite.) By compactness, one gets a model % - T 
whose £-reduct % is an elementary extension of % (by Proposition B.2.1)) 
of cardinality > x, since it contains the elements ree for i € x, which are 
pairwise distinct. 


Corollary 3.2.4. Let T be a theory having an infinite model. Then T has 
a model of cardinality x for any x > card(£). 


Exercise 3.2.5. Let £ be a finite language (that is, a language the signa- 
ture of which is finite), and let M@ be a finite L-structure. 
(1) Prove that there exists an £yy-sentence g € A(M) such that 
+ @ > g’ for any g’ € A(M). 
(2) Deduce that if MN = M, then N Y M. 


(3) (More difficult.) Let §N’ be a finite £’-structure, with L’ not 
necessarily finite. Prove that N’ = M' > MN’ = M’. 


3.3. Expansions by Definition 


It is often useful to enrich the language by symbols for definable rela- 
tions, functions or constants. We will describe in this section how this 
may be done formally. 


Notation. 4! xp (“there exists a unique x such that 7”) is an abbrevia- 
tion for Ix( AVX’ (Qy1/~ 3 X = x’)), with x’ a variable which is distinct 
from x. 


Definition. Let T be an £-theory and let £’ D> £. Assume that there 
are 
+ for any n-ary relation symbol R € £’ \ £ an L-formula 
PRX Xn)s 
+ for any n-ary function symbol f € £’ \ £ an L-formula 
PF(Xo,X1,++,Xp) such that T F Vx1,..., Xn! Xo Pf, 
* for any constant symbol c € £’ \ £ an £-formula ¢,(x9) such 
that T F A! Xo Gc. 
Then the £’-theory T’ given by T together with 


* an axiom VX),...,X;,(@p(X1,..-,X,) © RX, +++ X,) for any rela- 
tion symbol R € £’ \ £, 
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+ anaxiom VxX),...,Xn Pr(f(%1,--» Xn), X15 ++» Xn) for any function 
symbol f € £’\ £, 
* an axiom ¢,(c) for any constant symbolc € £’ \ £ 


is called an expansion by definition of T. 


Recall that two formulas ¢,(x),...,x,) and 9(x,,...,X,) are called 
equivalent in the theory T if T F Vxy,...,X,(@; © $2), and that they are 
called logically equivalent if they are equivalent in the empty theory (cf. 
Exercise 2.7.5). 


Lemma 3.3.1. Any formula is logically equivalent to a formula with all 
terms of height < 1. 


Proof. For gy a formula, let M(¢) be the maximal height of a term in 9. 
The proof will be by induction on (M(¢), ht(¢)) with lexicographic order. 
The only non-trivial case is that of an atomic formula 9, say of the form 
Rt, --- ty. To simplify notations, we assume that ht(t;) > 1fori=1,..,m 
and ht(t;) < 1 fori > m. Fori = 1,...,m, let t; = fi(sj1,-..,5;,,). One 
chooses new variables y; ;, fori = 1,..,mand1 < j < kj. Ify, ‘= 
(i,1> +> Vik)» then ¢ is logically equivalent to the formula ~ given by 


wy, oe ee [xe nae SmOmtm+1 sty A Nyii = 7) 

ij 
and which satisfies M() < M(@). The case where ¢ is given by t; = ft, 
is similar. 


Definition. Let T be an £-theory and T’ 3 T an L'-theory for some 
language £’ D L. One says that T’ is a conservative expansion of T if for 
any £-sentence w one has T F p if and only if T’ F ». 


Proposition 3.3.2. Let T’ be an expansion by definition of T. Then the ex- 
pansion T' D T is conservative. Furthermore, any £'-formula jp’ is equiv- 
alent in T’ toan £-formula wp. 


Proof. Any model ® of T has an L’-expansion (in fact unique) to a 
model §’ of T’. For a relation symbol R € L’ \ £, we may and need to 
define R™’ = gp[M] (cf. Exercise 2.7.4(1)). For a new function symbol 
f, this amounts to defining f” (ay,..., dy) = dp if ME P flo, Q1,--, An]. 
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The case of a new constant symbol is similar. It follows that T’ D T is 
conservative. 


In order to show that any £’-formula is equivalent in T’ to an L- 
formula, for simplicity we treat the case where L' = £ U{f} for an n-ary 
function symbol f. The general case is proved in a similar way. 


First let ~’ be an atomic £’-formula containing only terms of height 
< 1, say p’ equals Rt, --- t,,. Assume for simplicity that fori = 1,...,k one 
has t; = f(si,..., s!,) and that f does not appear in ¢; fori > k. The terms 
sj have height 0, and thus are £-terms. Let Z),..., 2; be new variables. 
Then the £-formula 


k 
AZ, Zk ([e “+ Zita ycttm A \\ Pp lZi, 81, 6] 
i=1 


is equivalent in T’ to y’. If p’ equals t, = t,, the argument is the same. 


Now let ?’ be an arbitrary £’-formula. By Lemma we may as- 
sume that all terms occurring in py’ have height at most 1. For any atomic 
subformula 7’ of =’, one chooses an equivalent £-formula v. (We just 
constructed such a formula.) One then defines p as the £-formula ob- 
tained by replacing any atomic subformula 7’ of p’ by the corresponding 
£-formula y. It is a general fact that replacing subformulas by equiva- 
lent ones produces an equivalent formula. (This is proved in Exercise 
where the notion of a subformula is also defined). 


Example 3.3.3. 


(1) Let T’ = Th(R) be the theory of the ordered field of real num- 
bers and T the theory of the (pure) field of real numbers. Since 
in R we have rr < s if and only if there exists t 4 0 such that 
t2 =s5-—r,T’ isan expansion by definition of T. 

(2) Let T be the £,,q-theory of total orders without endpoints, and 
let T’ be the Ly ;ing-theory of ordered fields. Then T’ is a non- 
conservative expansion of T, since for example T’ F Vx, y(x < 
y > Az(x < zAz < y)), that is, the order in any ordered field is 
dense (as any bounded interval has a midpoint), whereas this 
is not a logical consequence of T. 


(3) Let T = Th((N,;<)). Then w is a definable constant in T, that 
is, there is a formula g(x,) such that T F A! xg and (N;<) F 
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y[w]. Indeed, the set of limit ordinals is defined by the formula 
dyy <xAVydazyy<x>y<ZAz<x) 


which we denote by Lim(*), and it suffices to set p(x) equal 
to the formula Lim(x9) A Vy(Lim(y) > 7y < XQ). 


Exercise 3.3.4. Let T ¢ T’ be theories in languages £ and £’ D ZL, 
respectively. Prove that T’ is equivalent to an expansion by definition of 
T if and only if every model of T admits a unique expansion to a model 
of T’. 


3.4. Quantifier Elimination 


Using induction on height, it is easy to check that every formula ¢ is 
logically equivalent to a formula ~ which is in prenex form, that is, of 
the form Q,x; ---Q,x,7, with y quantifier-free and Q; € {5, V} for all i. 
In general, the number of alternations of quantifiers 4 and V provides 
a good indication about the complexity of the formula ¢ (and of the set 
defined by ¢ in a given structure). In concrete examples, sets defined by 
a quantifier-free formula are usually easier to handle than those defined 
by an arbitrary formula. For this reason, quantifier elimination results 
are very useful and often open the way for a deeper understanding of the 
theory under consideration. 


We shall start with the following important technical result. 


Theorem 3.4.1. Let T be an £-theory, n > 1a natural number and 


9(X1,..,X,) an L-formula. The following properties are equivalent: 


(1) There is a quantifier-free L-formula p(X}, ..., X,) such that p and 
y are equivalent in T. 


(2) Let M and N be models of T and let XA be a common substructure 
of Mand N. Then for any a € A” one has MF gla] —> NE 


glal. 


Remark 3.4.2. When n = 0 and ¢ is a sentence, one may consider ¢ as 
g(x) and apply the theorem to g(x) to find a quantifier-free formula (x) 
which is equivalent to g(x) in T. For instance, the theorem Ayy = y of 
T is equivalent in T to the formula x = x. 
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Note that if the language £ does not contain a constant symbol, 
there is no quantifier-free £-sentence. In this case, when we assert the 
existence of a quantifier-free formula i equivalent to a sentence g in what 
follows, we will allow that 7 has one free variable. 


Proof of Theorem 8.4.1 (1)=>(2). We first observe that if & C 8, 
p(x, ...,X,) is a quantifier-free formula and a € A”, then one has & F 
pla] —> BFE x[a]. Thus, if M et N are models of T having 2 as a com- 
mon substructure and if p(x, ...,x,) is equivalent in T to the quantifier- 
free formula (xj, ...,X,), then for a € A” one has 


ME pla] > ME Ya] — AF yY[a] 
—> Nepal] — NE gfal. 
(2)=>(1). We consider the set of formulas 
T(x) := {y(x},...,X,) quantifier-free | T F Vxy,...,x,(~ > y)}. 


We choose new pairwise distinct constants cj,...,c,, and we consider 
the theory ['(c) := {7(c,-.-,Cn) | ¥ € T'(x)} in the augmented language 
Li'=LU{cy,...,C,}. We now prove that 


(*) TUTE G0). 
If (4) did not hold, one could find Mm’ F TUT(C) U {=g(e)}. Let W’ := 
(ch, ..., C2’), = (Ai...) be the substructure generated by the elements 


id in M¢’. Observe that ['(c) C A(2’). Let us prove that 
x := TVA) vu {gO} 
has a model. 


Otherwise, T U A(2’) F =(c). Since any element of A can be writ- 
ten as an £’-term, if one denotes by A-(2’) the set of quantifier-free 
£'-sentences in A(2’), then T U A(2M’) is a conservative expansion of 
T UAX2’) by Proposition In particular, 

TU AA) F =G(C). 


Hence there exist quantifier-free £-formulas & (x), ..., &(x) such that 


k k 
TEA E@>7—9@) and = AQ’) F \ &@ =: EO. 


i=1 i=1 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


3.4. Quantifier Elimination 77 


Since the constant symbols c; do not appear in T, or in g(x) or &(x), one 
deduces (for instance by Lemma that 

TF Wx(§(x) > (x) 
and then T F Vx(~(x) - 7&(x)). By definition it follows that ~&(x) € 


T(x) and 7&(c) € I'(c), hence 7&(c) € A(2’), which provides a contra- 
diction. 


Hence & has a model ¢*, and the L-reduct MN of M* contains an iso- 
morphic copy 8’ of 2’ as a substructure by Proposition Up to iden- 
tifying B’ and 2’, we have constructed two models M@ = M’ | , and N 
of T containing a common substructure & = 2’ | - such that, if one sets 
aj = cae then 2 F y[a] and M - a~y[a], which contradicts (2). We have 
thus proved (fi). 


By compactness there exist ¢)(C), ..., G,(¢) € I'(c) such that 


m 
TEA&@ > ¢©. 
i=1 
As above, this implies that T F Vx ( ie G(x) = g(x). Since for all i 
we have T F Vx(9 — ¢;), we infer that T F Vx (As G(x) g(x), with 


We ¢(x) a quantifier-free £-formula. 


Definition. Let T be an £-theory. One says that T admits quantifier 
elimination (in the language £) if every £-formula ¢ is equivalent in T 
to a quantifier-free £-formula. 


Lemma 3.4.3. Assume that for every quantifier-free formula ¢ and any 
variable x there exists a quantifier-free formula tp such that 4x and yp are 
equivalent in T. Then T admits quantifier elimination. 


Proof. Let 7 and 7)’ be two formulas which are equivalent in T, which 
we denote by ~ ~7 #’. Since =p ~7 7’, Ax ~7 Axy’ and yAY ~7 
x Aw’ for any formula y, we can argue by induction on the height of 
the formula, and the statement follows, by considering only formulas in 
prenex form and eliminating one quantifier at the time. 


Theorem 3.4.4. Let T be an £-theory. One assumes that for any pair of 
models M and NK of T, for any common substructure A of M and N and for 
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any quanitifier-free formula p(X, ..., Xp), if there exista € A” and bh) € M 
such that MF g[bo, al, then there exists cg € N such that NF g[co, al. 


Then T admits quantifier elimination. 


Remark. The converse of this statement is clear: any theory which ad- 
mits quantifier elimination satisfies the hypothesis of the theorem. 


Proof of Theorem Let & CM, N be given, with M, NF T. Let ge 
be a quantifier-free formula and let v be 4x9. By hypothesis, we have 
ME y[a] —> NE y[a] forevery a € A”. It follows from TheoremB.4.1| 
that v is equivalent in T to a quantifier-free formula, which is enough to 
conclude by LemmaB.4.3 


Proposition 3.4.5. Let T be a theory which admits quantifier elimination. 
(1) Let M and N be models of T with a common substructure. Then 
m= N. 
(2) Let M and N be models of T. If M C MN, then M KN. 


Proof. (1) This isa special case of the easy implication in TheoremB.4.1} 
Indeed, any sentence ¢ is equivalent in T to a quantifier-free formula 
w(x). Let 2 be a common substructure of IN and NM. For any a € A, one 
then has 


MED —> MEY[a] > AF Y[a] 
—> NE ya] ] NE yg. 


(2) This is a direct consequence of Theorem 8.4.1]. 


3.5. Algebraically Closed Fields 


In this section we treat an important and particularly nice example of 
a theory from algebra, namely the £,jng-theory ACF of algebraically 
closed fields (see Example 2.6.3). 


Let A bea subring ofa field K. An element of K is said to be algebraic 
over A if it is the root of a non-zero polynomial with coefficients in A. IfA 
is an integral domain, an algebraic closure of A is an algebraically closed 
field K containing A such that any element of K is algebraic over A. 
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In the following fact we list some results from field theory which we 
will need in this and the next section. 


Fact 3.5.1. Let A be an integral domain. 


(1) There exists an algebraic closure of A. 

(2) If K and K' are algebraic closures of A, then there exists an iso- 
morphism f : K = K’ such that f | 4 = idg. 

(3) Assume A is a subring of an algebraically closed field L. Then 
the subfield ate = {b EL | bis algebraic over A} is an algebraic 
closure of A. 

(4) Let Fes be an algebraic closure of the field with p elements Fp. 
Then Fes is an increasing union of finite subfields Fy, N € N. 
More precisely, for any integer k > 1, the set of roots of the poly- 
nomial XP" —X in Foe is a subfield F yx (with p* elements), and 
Weary Fok = Fae, Since furthermore F yk © Fp when k | 1, it is 
enough to take Fy := Fpm. 

(5) Any algebraically closed field is infinite. 

(6) Let K C L bea field extension with K an algebraically closed 
field, and let b € L \ K. Then b is not algebraic over K. 


Theorem 3.5.2 (Chevalley-Tarski). The theory ACF admits quantifier 
elimination. 


Proof. First observe that a substructure ofa field in £;ing is nothing but 
a subring. By TheoremB.4.4it is thus enough to prove that if K and L are 
algebraically closed fields, A is a common subring, and ¢(Xpo,...,X,) isa 
quantifier-free formula, then for any a € A”, if there exists b € L such 
that L F g[b, a] then there exists c € K such that K F gl[c, a]. 


By FactB.5.1] K and L contain algebraic closures Fx and F, of A that 
are isomorphic via an isomorphism inducing the identity on A. Enlarg- 
ing A if necessary, we may thus assume that A is an algebraically closed 
field and even that A = K CL. 

The formula ¢ is logically equivalent to a formula of the form 
Ne Aj Xi,j, With each 7;,;(Xo,...,Xn) either atomic or the negation 
of an atomic formula. If L EF g[b,a], there exists i such that 
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Le p%ij [b, a]. It is thus enough to consider the case where ¢ is a 
conjunction of atomic formulas and negations of atomic formulas. In 
the theory of fields, any atomic formula is equivalent to P(x) = 0 for 
some polynomial P(x) with integer coefficients. We may therefore as- 
sume that 9(x) is of the form 


n m 
/\8@) =0A A -Q@) = 0. 
i=1 i=1 
If one of the B(x9, ay, ..., An) € K[Xq] is a non-zero polynomial, then b is 
algebraic over K, which implies that b € K and we are done. 


Thus we may assume that ¢ equals com 3Q;(x) = 0. By the exis- 
tence of b, each polynomial Q;(x9,a) € K[xo] is non-zero, and hence 
has only a finite number of roots. The field K is infinite, since it is alge- 
braically closed, so there exists c € K such that K F ¢[c, a]. 


Corollary 3.5.3. In K — ACF, the definable sets (with parameters) are 
precisely the constructible sets, that is, sets given by boolean combinations 
of polynomial equations with coefficients from K. 


Let p be a prime number or p = 0. We denote by ACF, the theory 
of algebraically closed fields of characteristic p. 


Theorem 3.5.4. Let p be a prime number or p = 0. The theory ACF, is 
complete. 


Proof. Any field of characteristic p > 0 contains F, as a subfield. If 
K and L are algebraically closed fields of characteristic p, then K = L 
by Theorem and Proposition B.4.5(1), which proves that ACF, is 
complete. 


For ACF, the argument is the same, replacing F,, by Q. 


Theorem 3.5.5 (Lefschetz Principle). Let p be an L,ing-sentence. The 
following conditions are equivalent: 
(1) CF@. 
(2) There exists an algebraically closed field of characteristic 0 in 
which ¢ is satisfied. 
(3) Any algebraically closed field of characteristic 0 satisfies gp. 
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(4) There exists N € N such that ¢ is satisfied in any algebraically 
closed field of characteristic p > N. 


(5) There exists an infinite set of prime numbers P such that for any 
p € P there exists an algebraically closed field of characteristic 
pinwhich ¢ is satisfied. 


Proof. (1) <=> (2) <> (3) follows from Theorem B.5.4. 


(3)=(4). Note that ACF is equal to ACF U {7p | p prime}, where 
Xp expresses that p = 1+--- + 1 is different from 0. If ACF F 9, by 
compactness there exists a finite subset A of ACF such that A F g. But 
A contains only a finite set of sentences Xp: Thus, there exists N € N such 
that K F A for any algebraically closed field K of characteristic p > N. 
For such a field K, one has K F 9. 


(4)=>(5) is clear. 
(5)=>(3). For p € P, let K, F ACF, such that K, F 9. If ACFp F 9, 
then ACF, F 7-9 by completeness. By the implication (3)>(4), there 
exists N € N such that 7¢ is satisfied in any algebraically closed field of 
characteristic p > N. This forces P to be finite, a contradiction. 


Theorem 3.5.6 (Hilbert’s Nullstellensatz). Let K be an algebraically 
closed field and P,(x),..., P(x) € K[x},...,X,]. If the system of polyno- 
mial equations P(x) = B(x) = ... = P,(x) = 0 has a solution in some 
field L 2 K, then it already has a solution in K. 


Proof. Let L > K anda € L” be such that R(a) = ... = P,(a) = 0. 
Up to enlarging L if necessary, we may assume that L is algebraically 
closed. Since ACF admits quantifier elimination, we have K < L by 
Proposition B.4.5. 

We now choose £,jng-terms F(x, Z;) and tuples b; in K such that 
B(x) = K(x, b;). Then L F ax \ FG, b;) = 0, and therefore K F ax /|\ 


F(x, bj) = 0, since K < L. 


3.6. Ax’s Theorem 


A chain of £-structures is a sequence (M;)jexy of £-structures such that 
M; CM; ,, for any i. 
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If (M;)iexy is such a chain, there exists a unique £-structure M with 
base set M = OF ex Mi such that ; C M for any i. Indeed, the only way 
to interpret the language symbols is to set c™ = c™o, f™ = LJ... f™ 
and R™ = gree R™i, which is clearly well defined. The £-structure M 
obtained that way is denoted by Up en Mie 


Definition. A formula of the form Vx, ...,x,4y,,..-, Ym, with g quan- 
tifier-free and m,n > 0, is called a V4-formula. 


Lemma 3.6.1 (Preservation of Va-sentences under unions of chains). 
Let p be a V3-sentence in £ and (M;) jen a chain of £L-structures such that 
Mm; F p for any i. Then M = LU, Mi Fy. 

Proof. Let ~ be the sentence Vx),..., X,Ay1,-->¥n0(% y) with g quan- 
tifier-free. We have to prove that MF Ay,,...,¥n¢la y] for any a € M". 
Since the sequence of base sets (M;)jen is increasing, there exists k € N 
such that a € My. Hence there exist bj,...,Dj, € M, such that M, F 
g|a, b], as M, F w. One deduces that M F ya, b], since ¢— is quantifier- 
free and MM, is a substructure of M. 


Remark. It will be proved in Exercise B.7.1]]that a sentence is preserved 
under unions of chains if and only if it is logically equivalent to a VA- 
sentence. 


Proposition 3.6.2. Let p be a VA-sentence in the language Ling which is 
satisfied in every finite field. Then ACF F 9. In particular 9 is satisfied in 
Cc. 


Proof. As recalled in Fact B.5.1(4), Fes is the union of a chain of finite 


fields. So it follows from Lemma that one has Fe’ & g for every 
prime p. The statement is now a consequence of the Lefschetz Principle. 


Theorem 3.6.3 (Ax’s Theorem). Let f : C” — C" bea polynomial 
mapping, that is, of the form f = (f,,..., fn) with f; © C[x1,...,X,] poly- 
nomials. If f is injective, then f is surjective. 


Proof. There exist £;j,g-terms — which can be interpreted as polyno- 
mials with integer coefficients — Ba(Z,X) such that, for any field K, 
any polynomial g(x) € K[x,,...,x,] of degree < d may be written as 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021 for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


3.7. Exercises 83 


P,,a(a, x) for some tuple a of elements of K. The following sentence 7p, q 
is V5 and expresses that any injective polynomial function f : K"” > K”, 
with all polynomials f; of degree at most d, is surjective: 


VZ1, sea an ,u man¥ ((Aa, a(Zis x)= = u) 
v(Aa 2,d(Zir®) = PaZir® A> \ x = “i 


i=1 i=1 


Since p,q is satisfied in every finite field, it follows from Proposi- 
tion B.6.2] that ACF F ?q_,, so in particular C F pq p. 


3.7. Exercises 


Exercise 3.7.1. Let I, and I, be two L-structures. Prove that M, = 
IM, if and only there exist N,, NR, and $B such that M; = MN; and N; < P 
fori = 1,2. 


Exercise 3.7.2. Let T = Th(%), where 9% denotes the £,,-structure 

(N;S,0,+,:,<). Prove that there exists %’ F T such that N’ contains 

a non-standard prime number, that is, an element p’ such that one has 

ME Vx(Gyx-y= p' > (x =1Vx= p’))and M EF (S--- $0) < p’ for 
n times 


anyneN. 


Exercise 3.7.3. Prove that any theory T admits an expansion by defini- 
tion that admits quantifier elimination. 


Exercise 3.7.4 (Vaught’s Criterion). Let £ be a first-order language and 
x an infinite cardinal. An £-theory T is said to be x-categorical if all its 
models of cardinality x are isomorphic. 


We assume that x is larger than or equal to the cardinality of £. 
(1) Let T be an £-theory. Prove that any infinite model of T is 
elementarily equivalent to a model of T of cardinality x. 


(2) Prove that if a consistent theory T is x-categorical and has no 
finite model then it is complete. 
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Exercise 3.7.5. Let £ bea first-order language, and let I be a non-empty 
set. Let (M;)j-; be a family of £-structures. Finally, let U be an ultrafilter 
on I. (We refer to Exercise for the notion of an ultrafilter.) If M; is 
the underlying set of §;, we consider the following equivalence relation 
on |], Mi: (ai) =u (bj) if and only if {i € I | a; = bj} € U. We denote 
by My the set of equivalence classes and by z : IL, M; > My the 
canonical projection. 


(1) Prove that =, is indeed an equivalence relation. 


(2) Prove that My admits a unique £-structure Ny such that 
for every atomic £-formula g(x, +++, x!) and every tuple 
(a',---,a") € (JJ, Mi)” one has 


My F g{z(a'), rg m(a")] 


if and only if the set of i such that MN; F aj, -+,a?] isin U. 
The structure My is called the ultraproduct of the MN; with re- 
spect to U. It is denoted by [],, Mj. 


(3) Prove Los’s Theorem: For every formula g(x!, --- x") and every 
tuple (a’,---,a") € ([], Mi)”, one has 


My F glz(a'), a m(a")] 


if and only if {iE I | M; F ylaj,---,a"]} € U. 

(4) Use Los’s Theorem to give a direct proof of the Compactness 
Theorem (Theorem B.1.1]). 
[Hint: For a theory T, let I be the set of all finite subsets of T. 
Show that there is an ultrafilter U on J which contains Ay, := 
{T, € I | T, D To} for every Ty € I. Prove that if Mp, F To for 
every Ty, then [],, Mr, is a model of T.] 


(5) Let P be the set of prime numbers, and let U be a non-principal 
ultrafilter on P. We work in £;;ng, and we consider the family 
(Fp)pep Of fields with p elements and their ultraproduct Fy 
with respect to U. Observe that Fy is a field. Determine its 
characteristic. 

(6) Let (M;)iey be a family of infinite well-orderings, considered 
in Lo,g. Let U be a non-principal ultrafilter on N, and let My 
be the ultraproduct of the %; with respect to U. 
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Prove that there is a strictly decreasing sequence in My of 
length &,. In particular, My is not a well-ordering. 


Exercise 3.7.6. 


(1) We denote by DLO the £,,g-theory of dense total orderings 
without endpoints. 

(a) Let M, M’ be models of DLO and let A C M, A’ CM’ be 
finite substructures. Assume that f : & & 2’ isan Lopq- 
isomorphism and b € M. Prove that f can be extended to 
an isomorphism with domain AU{b} and image contained 
in M’. 

(b) Deduce that DLO admits quantifier elimination and is com- 
plete. 

(c) Prove that DLO is X,-categorical. 

(d) (More difficult.) Prove that for every cardinal x > No, the 
theory DLO is not x-categorical. 


(2) Let L’ = Log U {c; |i © N}, with {c; | i € N} a set of constant 
symbols. Let DLO’ be the £’-theory obtained by adding to DLO 
the axioms c; < c for any i, j € N withi < j. 
(a) Prove that DLO’ is a complete theory. 
(b) Prove that the theory DLO’ has exactly three countable 
models up to isomorphism. 


Exercise 3.7.7. Let T be the £,,q-theory of total discrete orders with- 
out endpoints, that is, totally ordered sets such that any element has an 
(immediate) predecessor and an (immediate) successor. Observe that 
(Z;<) ET. 


(1) Prove that T does not admit quantifier elimination. 


(2) Write an £,,q-formula which defines the graph of the succes- 
sor function in any model of T. Let T’ be the definitional ex- 
pansion of T in the language L’ = {<, S}, where S corresponds 
to the successor function. Prove that T’ admits quantifier elim- 
ination and is complete. 


(3) Deduce that T is complete. 
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Exercise 3.7.8. Let m #4 n be natural numbers. Prove that there is an 
Lora-sentence which is true in (w - m, <) and false in (w - n, <). Is there 
an £o,q-Ssentence which is true in (w”, <) and false in (w", <)? 


Exercise 3.7.9. In this exercise, we consider the theory of the structure 
Mo := (Q;0,+,<). Recall that an ordered abelian group is an abelian 
group A endowed with a compatible total ordering (a < bimpliesa+c < 
b+c for any a, b,c € A). We shall deal with ordered abelian groups in the 
language Logg := {0,+, <}, and we denote by OAG the corresponding 
theory. 

Let DOAG be the theory of non-trivial divisible ordered abelian 
groups. It is obtained by adding to OAG the following axioms: 


(i) dxx #4 Oand 


(ii) an axiom of the form Vxdy y+ ---+ y = x for any integer n > 
Ne ey 


n times 


1. 
Observe that I F DOAG. 

(1) Prove that if (D;0,+,<) F DOAG, then (D;<) - DLO. (See 
Exercise for the theory DLO.) 

(2) Prove that if (A;0,+,<) EF OAG, then (A;0, +) is torsion-free, 
that is, ifna = 0 fora € Aandn E€ N*, thena=0. 

(3) Let S F DOAG and let & = (A;0,+,<) C D. We denote by 
divs(20) the substructure with base set 


{d€D|4dn>1landaeAsuchthata+n-de A}. 


Prove that if ,9’ F DOAG and & C 9, A’ C 9’, then every 
isomorphism f : 2% = 2’ extends uniquely to an isomorphism 
f 2 dive (Q) & dive (2’). 

(4) Prove that DOAG is complete and admits quantifier elimina- 
tion. 

(5) Deduce that if 5 F DOAG and ¢(x) is any (Loqg)p-formula, 
then the subset of D defined by g(x) is a finite union of open 
intervals and singletons. 


Exercise 3.7.10 (Embedding Theorem). Given languages £’ C £ and 
an £-theory T, we denote by Ty (or by Ty incase £ = £’) the theory of 
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all universal £'-sentences (that is, sentences of the form Vx,,..., x, for 
y quantifier-free) which are consequences of T. 


(1) Prove the Embedding Theorem: An L’-structure M’ embeds 
into the L'-reduct of a model of T if and only if M’ F Ty. 
[Hint: Use the diagram method to establish the non-trivial im- 
plication. | 


(2) Application. A group G is called left-orderable if there exists 
a total order < on G such that for all elements x, y, g from G, 
x < yentailsg-x < g-y. Nowlet Ly, = {e, -,(-)71} be the 
language of groups. 
(a) Prove that the class of left-orderable groups may be axiom- 
atized by a universal £,,-theory. 
(b) Give an explicit (universal) axiomatzation in the case of 
commutative groups. 


(3) (a) Prove the Preservation Theorem for universal theories: T 
may be axiomatzed by universal sentences if and only if it is 
preserved under substructures, that is, any substructure of 
a model of T is a model of T. 

(b) Let T be the theory of fields in £,jng. Determine the mod- 
els of Ty. 


Exercise 3.7.11 (Preservation Theorem for Vi-theories). The aim of this 
exercise is to prove that the following conditions (i) and (ii) are equiva- 
lent for a theory T (Chang-Los-Suszko Theorem): 


(i) T is preserved under unions of chains, that is, if (Mien is a 
chain of £-structures such that M; F T for all i, then J,-., Mi 
T. 


(ii) T may be axiomatzed by VA-sentences. 


ieN 


(1) Observe that (ii)>(i). 
(2) Let M C MN. We say that M@ is 1-elementary in N and we write 
M <, N, if for any existential formula 9(x),...,x,) and anya € 
M" one has M F gylay,...,a,] if and only if N F gylay,..., ay]. 
(The same is then true for universal formulas as well. Why?) 
Prove that if M@ <, ®, then there is Mt’ D MN such that 
M <M’. 
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[Hint: Use the Embedding Theorem from Exercise B.7.10}] 


(3) Let T be a theory which is preserved under unions of chains. 
One sets 


Ty3 = {g | g isa W4-sentence in £ such that T F ¢}. 


Prove that every model of Ty; admits a 1-elementary extension 
which is a model of T. 


(4) Prove that (i)>(i) and conclude. 


(5) Prove that a sentence ¢ is preserved by unions of chains if and 
only if it is logically equivalent to a Va-sentence. 


Exercise 3.7.12 (Indiscernible Sequences). Let % be an £-structure, and 
let (I, <) be an infinite totally ordered set. The Ehrenfeucht-Mostowski 
type of a sequence a = (dj)j¢; in M is given by EM(a) = (EM,,(@))ys1, 
where for each n > 1, EM,,(a) denotes the set of £-formulas g(x, ..., X,) 
with M F g[aj;,,...,a;,] for all i) < --- <i, in I. We call a indiscernible 
if for any £-formula ¢(x1,...,X,), either ¢ € EM,,(a) or 7g € EM, (a), 
that is, if one has 

(*) Me glai,, oe a;,,| if and only if M F gla;,» aay aj, ] 

for alli; <...<i, and j, <... < j, from I. 

Now let VJ, <) be an infinite totally ordered set and b= (bj) jer an 
arbitrary sequence in M. Prove that there exists % > IM containing an in- 
discernible sequence (a; );-; such that EM,,(a) D EM,,(b) for all integers 
n>1. 

[Hint: Use Ramsey’s Theorem (cf. Exercise to prove that for any 
finite set of formulas ®, there exists an infinite subset J’ of J such that the 


equivalence (#4) holds for all finite subtuples of b indexed by increasing 
sequences of elements in J’ and all formulas from ®.] 
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Chapter 4 


Recursive Functions 


Introduction 


In this chapter, we will develop a theory of ‘computable’ functions and 
sets of natural numbers that will play a fundamental role in the study of 
incompleteness of Peano arithmetic that will be undertaken in the next 
chapter. The basic building block of computability is provided by the 
notion of primitive recursive functions which we introduce in Unfor- 
tunately this notion is too restrictive since, as the example of the Ack- 
ermann function shows (.2)), some obviously computable functions do 
not belong to this class. A satisfactory definition of computable func- 
tions is provided by the notion of general recursive functions which is 
introduced in f.3, 


Another natural candidate for the class of computable functions are 
the functions which are computable by a Turing machine that we in- 
troduce in f.4. It is a fundamental result that the two notions coincide: 
a function is recursive if and only if it is Turing computable. One ad- 
vantage of Turing computability is that it easily yields the existence of 
universal recursive functions that play a major role in the theory. 


We conclude this chapter with the study of recursively enumerable 
sets (4.6). These are sets of natural numbers for which there exists an al- 
gorithm that enumerates their members. A set is recursive if and only it 
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is recursively enumerable and its complement is also recursively enu- 
merable. However not all recursively enumerable sets are recursive, 
which leads to several negative results like the undecidability of the Halt- 
ing Problem or Rice’s Theorem. 


4.1. Primitive Recursive Functions 


Notation. For n € N, weset %, := {f : N” > N}. Moreover, we set 
FS nek F,. The function f € F,, which to (xj,...,x,) associates 
f(@1,--» Xn) € N will sometimes be denoted by 


f =AX1 + Xq f (Ky 5s Xp): 


Definition. The set of primitive recursive functions is the smallest sub- 
set E of ¥ which satisfies the following properties: 


(RO) FE contains the following basic functions: 
¢ S =Ax.x + 1 (the successor function), 
+ the 0-ary (constant) function C) equal to 0, that is, C) = 
4.0, 
» for any n © N* and any 1 <i <n the projection PR” on the 
ith coordinate, that is, B” = Ax, --- Xp.X;. 


(R1) E is stable under composition: if f,,...,fn € FyN EF andh € 
F, OE, then g = h(f,,...,f,) € E.- 


(R2) E is stable under recursion: if g € F, Eandh € F,,,NE, 
then the function f defined by 
f(y, +5 X50) 2 = B(X,--, Xp), 
fs Xs YHI) 2 = hy, Xn VY. f (X15 > Xn Y)) 
is also in E. 
We observe that the constant functions C? = Ax, --- x,.k are primi- 


tive recursive for any k,n € N. (Note that the function C9 is obtained by 
recursion from g = C$ and h = P?.) 


If X CN", its characteristic function is denoted by 1x, that is, 
1 if (x,...,x%,) EX3 
0. else. 


Ty (X1,..,X,) = | 
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Lemma 4.1.1. The functions Axy.x+y, Axy.x-y, Axy.x¥, Ax.x!, Axy.x-y 
and 1 are primitive recursive. Here,x~y :=x—yifx >y,and xy := 
0, else. 


Proof. The addition f = Axy.x+y is obtained by recursion from g = P} 
andh =SoP3,asx+0=xandx+(y+1)=S(x+y). 

The other functions are easily obtained as well. Let us give the ar- 
gument for Axy.x—y and 1,.. One first constructs Ay.y—1 by recursion, 
via 0-1 = 0, (y+ 1)-1 = y. One may then define x0 = x, x-(y+ 1) = 
(x-y)-1. Finally, 1y.(x) = 1-(1—x). 


Definition. A subset X of N” is called primitive recursive if its charac- 
teristic function 1y is primitive recursive. 


Lemma 4.1.2. 


(1) The set of primitive recursive functions is stable under a permu- 
tation of variables. 

(2) IfX CN" is primitive recursive and if f,, .... fn © Fp are primitive 
recursive, then the set 


Y= {Xi 05 Xp) EN? | Gio) seg) Eh 
is primitive recursive. 


(3) The set of primitive recursive subsets of N" contains 8 and N", 
and it is stable under U, N and passage to the complement (so 
under boolean combinations). 


(4) The set {(x, y) | x < y} C N? is primitive recursive. 

(5) (Definition by cases.) Let N" = A,U...UA, be a partition of N” 
into finitely many primitive recursive sets Aj, and let f,,..., fic € 
F,, be primitive recursive. Then the function f, defined by f(x) = 
fi@) ifx © Aj, is primitive recursive. In particular, the functions 
AX, +++ Xp. Max(x;) and Ax, ---X,.min(x;) are primitive recur- 
sive. 


(6) (Bounded sums and products.) If f € F,,41 is primitive recursive, 
so are the following functions: 


y y 
Ax, ++ Xny. >) f(t) and Ax, + x»y. |] f@.0. 
t=0 t=0 
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(7) (Bounded -operator.) LetX C N"*1 be primitive recursive. Then 
f © Fnsy F(X, z) = (ut < z)((%, 1) EX), defined by 


0 ifthereisnot < zwith (x,t) EX; 
f(%,Z) = 4t9 ft is the smallest natural number 
t < zsuch that (x,t) EX 


is a primitive recursive function. 


(8) (Bounded quantification.) If X C N"*? is a primitive recursive 
set, so are the sets 


Xe = {(Xq,-05Xp,Z) EN"! | (Ct < z) (x,t) € X} and 
Xq = {X15 Xn, Zz) E NV+! | (Wt < z) (x,t) © X}. 


Proof. Part (1) is clear, and (2) follows from ly = 1x(f,,.... fy). 
(3) One has 1n\x = 1-1x and Ixny = 1x - Ty. 
(4) One has 1.(x, y) = Iny*(y—X). 
(5) One has f = 14, fi +... +a, fe 
(6) is proved by recursion. 
(7) Given X C N"t!, one has f(x, 0) = 0, 


f@%z) ifD_px@ozh 
f@%z+l=4z4+1 if, 1x(%t) = Oand &%z+1) EX; 
0 else. 


Thus f is primitive recursive (by recursion and definition by cases). 


(8) Passing to the complement, it is enough to treat the first case. 
One has tx, (%,z) = lif )7_, Ix(%,f) > 1, and ly, (%, z) = 0, else. 
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Example 4.1.3. 


(1) Let q : N? > N be the function which to (x, y) associates the 
integer part of x/y if y # 0, and 0 else. Then q is primitive 
recursive. [Indeed, q(x, y) = (ut < x)((t+1)-y>x).] 

(2) {(x,y) € N? : y | x}is a primitive recursive set. [Indeed, one 
hasy|x <= 4zz-y=x = x=q(%,y)-y.] 

(3) The set P of prime numbers is primitive recursive. [Indeed, 
xEP = x>2AVY <x) Y|x>~W=1Vy=xX)). By the 
closure properties stated in Lemma P is thus primitive 
recursive. | 

(4) The function z, which to x associates the (x + 1)th prime num- 
ber, is primitive recursive. [Indeed, one has z(0) = 2, z(x + 
1) = (uz < m(x)! 41) (z > (xX) AZ € P).] 

(5) A primitive recursive bijection between N? and N is given by 
the function a, = Axy.2(x +y+1)(*+ y) +x. Moreover, 
there are primitive recursive functions 67,83 € 4, such that 
a,(6?, 83) = idy. [Indeed, as a,(x,y) > min(x,y), one has 
B2(x) = (uz < x)(At < x)(a,(z, t) = x)), and similarly for the 
function 3.] 

By induction on p > 2, one defines a primitive recur- 
sive bijection a, : NP — N whose inverse has primitive re- 
cursive components bP open pe. For this, it suffices to set 
Op41(%1, seg Xp41) = Ap(%X1, seg Xp-1> A2(Xp, Xp41)): 

If (Xo, ..-,Xn—1) is a finite sequence of natural numbers, we define its 
Gédel number as 
(Xqy0)Xy_y) 1= 1(0)* +... (Nn — 2)*"2 - r(n — 1)*r-1t! — 1, 


Lemma 4.1.4. The map ( ) defines a bijection between the set of finite 
sequences of natural numbers and the set of natural numbers N. It satisfies 
the following properties: 

(1) The binary component function (x);, defined by 
x, ifi<n 
0 else, 


(X00) Xn-1))i = | 
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is primitve recursive. 
(2) The length function, given by lg({Xo,...,X,_1)) = n, is primitive 
recursive. 


(3) Foranyn EN, () tT yn : N” > Nis primitive recursive. 
(4) lg(x) < x for any x, and if x > 0, then (x); < x for any i. 


Proof. Part (4) is clear, and (3) follows from Example 4.1.3). 


(2) One has Ig(x) = (uy < x)[(Wz2< x)(y< 27> 2(z) + (x +1) if 
x > 0 and 1g(0) = 0, so lg is primitive recursive by Lemma 


(1) The description 
0 ifi > lg(x), 
(x)i = y(uy < x) (ai)? (x +1)) ifi+1=l1g(x), 
(uy <x)(x(iPt1 + (x+)) ifi+1 < lg(x) 


of the component function yields its primitive recursiveness. 


4.2. The Ackermann Function 


We now define the Ackermann function € € 4, a function which is com- 
putable in the intuitive sense but which turns out to be non-primitive 
recursive: 


» &(0,x) = 2%, 
* €(,0) :=1, 
© &yt+1x+) :=&9,6 +1,x)). 


This function is well-defined and may be computed in a finite number 
of steps (exercise; it suffices to consider the lexicographic order on N*). 


We will also write &,(x) for €(n, x), and E* for &, 0...0 &). 
Se 


k times 
Lemma 4.2.1. 
(1) &,(x) > x foralln,x EN. 
(2) &, is strictly increasing for alln € N. 


(3) Sn4iX) = n(x) for all n, x EN. 
(4) & is strictly increasing for all k,n € N. 
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(5) Ek(x) < E€+1(x) for all k,n, x EN. 
(6) E(x) < Ek(x) forall k,m,n,x € Nwithm <n. 
(7) EE(x) < Ena (x +k) forallk,n,x EN. 


Proof. We prove (1) and (2) simultaneously by induction on n. The case 
n = Oisclear. Suppose now that (1) and (2) hold for n. By the induction 
hypothesis, one has §,41;(% +1) = €,(&:41(%)) > &n41(x), showing that 
(2) holds for n + 1. As &,4)(0) = 1 by definition, (1) follows from (2). 


(3) En41(0) = €n(0), and En4i(% +1) = En(Ena1 0) = n(x + 1) by 
—_—— 
2x+1 

(1) and (2). 

(4), (5) and (6) are clear. 

(7) This is proved by induction on k, the case k = 0 being clear. Now 
suppose that the result holds for k. Using (3), one then has &k+1(x) = 
E(EK(x)) < En(Enar(X +) = Ena (x +k + D. 


Definition. We say that a function f € 4%, dominates the function 
g € F, if there exists N € N such that for all x € N” one has g(x) < 
f (max(xj, ..., X,,N)). 


Note that if f is strictly increasing, then f dominates g ifand only if 
g(x) < f(max(xy,..., x,,)) except for a finite number of n-tuples. 


We denote by C,, the set of functions in ¥ which are dominated by 
at least one of the functions €*, where k € N. 


Lemma 4.2.2. 


(1) Cy, © Cy foranyn <m. 

(2) €o contains the basic functions (S, C8 and all the projections B") 
as well as the functions Axy.x + y, Ax.k - x (for fixed k) and 
AX, ... Xp. Max(X],..., Xp). 

(3) C, is stable under composition. 

(4) Ifg € FNC, andh € Fy42 N Cp, then the function f defined 
by recursion from g and h is in Cy41. 


In particular, the sete = U 
functions. 


nen En contains all primitive recursive 
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Proof. (1) and (2) are clear. 


(3) Suppose that ff,..., fin © FMC, and g € Fy, A Cy are given. 
There exist natural numbers N,N,,...,Njy>k, ky, ky, Such that g(y) < 
EK(max(yj,...,¥m»N)) for every y € N™ and, for any i € {1,...,m} and 
x ENP, f(x) < En! (max(xX, Xp» Nj). Set M = max(N,MN,...,N,,) and 
1 = max(ky,...,k,,). Now let x € NP. Setting M; := max(X},...,Xp,M), 
one then has 


BAX), fin) S En (En (Mz)) = St! (Mg). 


(4) By definition of the recursion, one has f(x,0) = g(x) and 
f@ y+1) = h(x, y, f(, y)). By assumption, there exist natural numbers 
k,,N,,k2,Nz such that g(x) < En} (max(2xy, ..., Xp») and h(x, y,t) < 


2 (max(x,, Xp yet, N,)). Using induction on y, one easily obtains 
that 


- ky +yk 
(*) FR y) < Ex max(xy,... Xp, ¥s Ni Na))- 
Applying Lemma ..2.1(7), one deduces from (#4) that 
FY) S Sn4imax(xy,..., Xp, y,Ny,N2) + ky + kay), 
Ny 


a composition of functions in Cy 41 


whence the result by (3). 


Theorem 4.2.3. The Ackermann function is not primitive recursive. 


Proof. Suppose, for contradiction, that € is primitive recursive. Then 
Ax.&(x, 2x) is primitive recursive, too, and so, by Lemmaf..2.2, there exist 
k,n,N € N such that (x, 2x) < K(x) for all x > N. By Lemma f.2.1(7) 
it follows that §.(2x) < €,41(« +k) for all x > N. For x > max(k,n +1), 
this is a contradiction. 


Remark. An easy induction on n shows that all the functions &, are 
primitive recursive. 


4.3. Partial Recursive Functions 


The existence of the Ackermann function — computable in the intuitive 
sense but not primitive recursive — shows that we ought to enlarge the 
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class of functions we are considering. For technical reasons it is more- 
over convenient to pass to partial functions. 

By definition, a partial function from N" to N is a pair (A, f), with 
ACN" and f : AN. The set A is called the domain (of definition) 
of f and denoted by dom(f). We write 7%," for the set of such pairs, and 
Fr= Ue cn Jn- Most of the time, we will write f instead of (A, f). 


Definition. The set of partial recursive functions is the smallest subset 
E of ¥* which satisfies the following properties (RO)*-(R3)*: 


(RO)* E contains the basic functions (S, Ce and all the projections 
R?), 


(R1)* Eis stable under composition (of partial functions): Given par- 
tial functions f,,..,f. € Fn NE andh € JF, &£, then 
g = h(f.--»f,) © E, where g is the partial function which 
to x assigns h(f,(x),...,f,(x)) if x © dom(fj) for all i and 
(fi), .... f(x)) € dom(h). Otherwise, g is not defined for x. 


(R2)* E is stable under recursion (of partial functions): Given g € 
Fy NE andh € F,,, 1 E, then f € E, where 

« f(x,0) = g(x) ifx € dom(g), and otherwise f is not de- 
fined for (x, 0); 

f(x, y+1) =h@, y, f(x, y)) if f is defined for (x, y) (this 
property is defined simultaneously by recursion on y) and 
if &, y, f&, y)) € dom(h), and otherwise f is not defined 
for (x,y + 1). 


(R3)* E is stable under the u-operator: Given f € F,',, 1 E, then 
g © E, where g € 57, g(x) = uy(f(% y) = 0) is the partial 
function defined as follows: 

« if there is z such that f(x, z) = 0 and (x, z’) € dom(/f) for 
all z’ < z, then g(x) is the minimal such z; 
* otherwise, g is not defined for x. 


A partial function f € F,; is totalif dom(f) = N”. A (total) recursive 
function is a total function which is partial recursive. 


The following operation is reserved for total functions. 
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Let f € F,4, bea total function. Assume that f satisfies 


VX1,..,X,dyf(, y) = 0. 


Then we say that the (total) function g = Ax.uy (f(x, y) = 0) is obtained 
from f by the total -operator. 


We say that a set of total functions satisfies property (R3) if it is stable 
under the total x-operator. 


Exercise 4.3.1. Prove that the Ackermann function is recursive. (See 


also Proposition 4.5.4) 


4.4. Turing Computable Functions 


Turing machines provide a way to make precise the notion of a function 
which is ‘computable by a machine’. 


Definition. A Turing machine M is given by the following data: 


« A finite number (> 1) of tapes B,, By,... placed horizontally, 
each bounded on the left and unbounded on the right; each 
tape is divided into squares which are numbered by N* from 
left to right. The tapes are arranged in such a way that the 
squares with the same number lie on the same vertical line. 


« A read-write head which may read, erase and write symbols 
on the tape squares (one symbol per square). The head moves 
horizontally and is placed, at any moment, over one vertical 
column, and it manipulates all squares of this column simul- 
taneously. 

The set of symbols is given by S = {§, |, b} (b is called the 
blank symbol). 


Here are the data which are specific to WM: 
* n=n(M) € N* (the number of tapes). 
« A finite set Q of states. There are two (distinct) distinguished 
states in Q: qj; (the initial state) and q, (the final state). 
- A function M : S"xQ—-> S” x Q x {-1,0, 1}, the transition 
function of M. 


A description of the way the machine M operates: 
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« At every instant (point in time) t € N, M isin one of the states 
from Q. 


« M operates at every instant by changing states, erasing and 
writing symbols on the tapes and moving its head. 


The way M operates is subject to the following rules: 


(1) At the instant t = 0, M is in the initial state, and its head is 
placed over the squares number 1. 


(2) At every instant t, W reads the symbols (sj,...,5,) € S” writ- 
ten in front of its head, and the transition function M describes 
what M is supposed to do. If W isin state q and reads (s},..., Sy), 
and if M(s,q) = (s’, q’,€), then the head erases s, writes 8’, 
moves by ¢ horizontally, and M changes its state to q’, then 
passes to the instant ¢t + 1. 


(3) M stops to operate if it attains the final state. 
Correct input: 


« At the instant t = 0, $ is written in the tape squares number 1, 
and only there. 


* On each square a symbol from S is written, and only finitely 
many symbols are non-blank. 


(These conditions then remain satisfied during the whole computa- 
tion). 


Constraints: 
+ For anys € S”, one has M(s, e¢) = (5, e,, 0). 


* The head cannot erase the symbol $ (which marks the begin- 
ning of the tape) or overwrite a symbol different from $ with 
$: 

- For any q € Q, M((§.,...,$), q) = ((S,..., $), q’, €) for some 
e € {0,1}; = 

- ifs # ($,...,$), then M(, q) = (s’,q',¢) with s; # $ for all 
i. 


Remark. A Turing machine is deterministic, that is, every step in the 
computation follows in a unique way from the preceding one, and one 
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may thus predict for every instant t the position of the head, the state of 
the machine and the content of the tape squares. 


Definition. 


(1) A tape represents (at a given instant) the natural number m if 
what is written on it equals (, |,...,|,D,...,D,...). 
m times 
(2) A Turing machine M computes f € F,; ifn(M) = p +1 and if 
for all m € N?, when M starts operating with the input where 
fori = 1,..., p, the tape B; represents m; and, fori > p, the tape 
B; represents 0, then 

- ifm € dom(f), then M stops after some finite time, 
and its tapes successively represent the natural numbers 
(M,.., Mp; f(m),0,...,0) (in this order); 

- ifm ¢ dom(f), M either never stops, that is, qr is 
never attained, or WM stops after some finite time, but 
when it stops there is no n € N such that the tapes of 
the machine M successively represent the natural num- 
bers (mj,..., Mp, N, 0,..., 0). 


(3) The partial function f is Turing computable if there is a Turing 
machine M which computes f. 


Remarks. 


(1) There are many variants of the model of a Turing machine (one 
may for example work with tapes which are unbounded on the 
left and on the right), all leading to the same notion of Turing 
computability. 

(2) Using unary code to represent natural numbers is of course 
very inefficient, but it suffices for our purposes, since we are 
not concerned with practical feasibility or complexity issues in 
this book. 


Lemma 4.4.1. The basic functions (S, C§ and the projections PB") are Tur- 
ing computable. 


Proof. CQ is computed by a Turing machine M with one tape, a set of 
states Q = {q;,q,}, and with a transition function M($, q;) = ($, qf, 0). 
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(Here, and in what follows, we only give the relevant part of the transi- 
tion function M.) 

R" is computed by M with n + 1 tapes, a set of states Q = {q;,q/}, 
and with a transition function M(§, ...,$,q;) = ($,...,$,q;, +1) and 


(S},--5Sy,|,€;,+1) ifs; =|, 


M(3s,..., Sn, 5, qj) = 
(s1 nv bai) eeaeteae ifs; =b. 


The successor function S is computed by a Turing machine M with 
two tapes, a set of states Q = {qj, af}, and with a transition function 
M(S, $, qi) a ($, $, qi> +1), M(, b, qi) = (I, |, qi> +1) and finally M(b, b, qi) 
= (b, |, qf>0). 


Lemma 4.4.2. The set of partial Turing computable functions is stable 
under composition. 


Proof. Let f,...,f, € Fp and g € F,' be Turing computable, and let 
h = 8(fir--> In): 

By assumption there exist M;,1 < i < n, where M; is a Turing 
machine with p; > p + 1 tapes and a set of states Q; which computes fi, 
as well as a Turing machine M’ with n’ > n+ 1 tapes and a set of states 
Q’ which computes g. 

Let M be a Turing machine with p+(n’—n)+ Yi — p) tapes and 
a set of states Q = {qq, qc}UQ'UU, Qi. The initial state of I corresponds 
to that of 7, and its final state to that of M’. 

The machine M operates as follows (the precise description of the 
transitifion function of M is left as an exercise): 

If (m,...5 Mp, 0,...; 0) are represented on the tapes, M starts to com- 
pute f,(™m),..., Mp) as M, would do, using the states in Q, \ {ay} and 
(Pp, — p) auxiliary tapes (all different from B,,,). In state q¢> it returns 
to the beginning of the tapes, then passes to state q? € Q, and computes 
i), using the states in Q, \ {gq} and (pz — p) auxiliary tapes (all dif- 
ferent from B,,, and from the previous auxiliary tapes). In this way, M 
successively computes f,(m),..., f,(m). Once f,(m) has been computed, 
M passes to the initial state of Wl’ and computes h(f,(™),..., f,(m)) as 
M’' would do, using Q’ and n'—n auxiliary tapes (distinct from B,,;). For 
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this computation, it uses the tapes on which are represented the f;(m) 
as input tapes and B,,; as an output tape. Renumbering the tapes, the 
transition function of may thus be established using the transition 
functions of M,,...,M,, and M’. 

Once h(f,(™m), ..., f,(m)) is computed, instead of passing to the final 
state of 1’, M passes to state qq which serves to move its head to the 
beginning of the tapes. Then, in state qe, it ‘clears’ the tapes on which 
are represented the f;(m), and finally it passes to the final state. 


Lemma 4.4.3. The set of partial Turing computable functions is stable 
under the -operator. 


Proof. The easy proof is left as an exercise. 


Lemma 4.4.4. The set of partial Turing computable functions is stable 
under recursion. 


Proof. Suppose that g € F," is computed by the Turing machine M with 
p+1+k tapes and a set of states Q, and that h € F,,, is computed by 
M' with p+3+k’ tapes and a set of states Q’. Let f € 5,4, be the partial 
function defined by recursion from g and h, that is, f(x,0) = g(x) and 
fy +1) =hwy, fy). 

We now give an informal description of a Turing machine N which 
computes f: N has p+4+k+k’ tapes, and its set of states is given by 
QUQ’ plus a couple of auxiliary states, with qj’ = q}* and q? = qr. 

On input (mj, ..., p41), N operates as follows: 


(1) Compute g(m,,...,m™,) with input tapes B,,...,B,, an output 
tape B,,, and auxiliary tapes B,,5,...,,Bp444x, operating as M 
would do, up to renumbering the tapes. 

(2) Compare m,, and the natural number y which is represented 
on B,,3 (note that during the whole computation, B,,3 will 
represent a natural number < mp41): 

+ if m4; = y, then go to step (5); 
+ if mp4; > y, then go to step (3). 

(3) Compute h(m),..., Mp, y, f(x,y) = f(xy + 1) as M’ would 
do, with input tapes B,,...,B,, By,3,Bp,2, an output tape B, +4 
and the last k’ tapes as auxiliary tapes. 
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(4) Copy the content of B,,4 onto tape B,,», then clear tape B, 44 
and increment the content of B,,3 by one, that is, pass from y 
to y +1. Then go back to step (2). 


(5) Clear tape B,,3 and stop. 


Theorem 4.4.5. Every partial recursive function is Turing computable. 


Proof. It is enough to combine the four preceding lemmas. 


In order to establish the converse of Theorem.4.5, we ought to code 
Turing machines and the way they operate. 


Coding of Turing machines. We identify the set S = {b,$,|} with 
{0, 1, 2}, via 0 + b,1 + $ and 2 «|. We code a sequence (s;);<, of sym- 
bols in S, or a sequence (s;);ey With s; = 0 for almost all i, by I'((s;)) = 
ise 5,3! EN. 
Let M be a Turing machine. Then M is given by: 
* n=n(M) > 1, the number of tapes of M. 
¢ The set of states Q. We will suppose that Q = {0,..., m} (thus, 
m > 1 equals card(Q) — 1), with qj = O and q;7 = 1. 

¢ The transition function M : S"°x Q > 8S" x Qx {-1,0, 1}. 
To code the transition function M, if p = (sj,...,5,,q) € S” x Q and 
M(p) = (ty, .--, tn, gq’, &), we Set 


n(P) = a2(T(s}, aoe Sn), q) 
and 
r(p) a a3(T(ty, oa] tn), qs E+ 1), 


and then 


rMt= TT xn(oye. 


pEes"?xQ 
To decode, we will use the following function 6 € 4, defined by d(i, x) : 
(uz < x) (z(i)?*" + x). We then have 


5(az(T(s,,.-,5,),Q),° M7) = a3(T (ty, ..., ty), Q',€ +1). 


Finally, we define the index of M as "M7 = a3(n,m,"M"). 
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Lemma 4.4.6. For any p > 0, the set 
Ip = { M" | M is a Turing machine with at least p + 1 tapes} 
is primitive recursive. 
Proof. It is easy to see that one may recognize in a primitive recursive 


way if the constraints on the transition function are met. The details are 
left as an exercise. 


A configuration C = C(t) of M (at an instant t) is a sequence (s;); € 
S® such that s,4, equals the symbol which is written on the (v + 1)th 
square of tape B,,,,. (We suppose that there is only a finite number of 
non-blank symbols and that $ is written at the beginning of the tapes 
and only there.) 


We code the configuration C by '(C) := I'((s;)). To decode, we will 
use the function 


n(T(C), U, V, n) =r (q(I(C), gntu-1)+(-D), 3) 


(here, q denotes the result of the division with remainder, and r denotes 
the remainder), which gives the symbol s which is written on the uth 
square of tape B,. If o = (s},...,5S,) is the sequence of symbols written 
on the squares of number u, then (a) = e(I'(C), u, n), where E(x, y, Z) = 
1 (q(x, 379-Y), 37). 

The situation Sit = Sit(t) of M (at an instant t) is given by (q, k, C(t)), 
where q denotes the state of M at the instant t, k the number of squares in 
front of which is placed the head at the instant t and C(t) the configura- 
tion at the instant t. We code the situation via I(Sit(t)) = a3(q, k, [(C())). 


Lemma 4.4.7. Let p > 0. There exists a primitive recursive function g? € 
F, such that the following properties hold: 


¢ gP(i,x) =O0ifi EI); 
¢ ifi ="M" and x is the code of the situation of M at the instant t, 
then g? (i, x) is the code of the situation of M at the instant t + 1. 


Proof. We suppose that i = "MM" € I,. Then 


+ the current state of M is B}(x) = q, 
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+ the number of squares which are currently observed is 63(x) = 
k, 

« the code of the current configuration C(t) of the machine M is 
B3(x) = T(C(), 

- the number of tapes of M is given by 63(i) = n, 

+ the number of states of I minus 1 is given by 63(i) = m, 

+ the code of the transition function is given by 63(i) = "M7, 


« the code of the sequence of symbols read by the head at the 
instant t is given by 


e(I(C(t)), k,n) = €(83(x), B3(x), BI@) =! ¢. 

If x is not the code of a situation, we set g?(i, x) = 0. This is the case 
if B3(x) > m or B3(x) = 0 or if 63(x) is not the code of a configuration 
with $ at the beginning of the tapes and only there (the last condition 
may be expressed in a primitive recursive way, using the functions ¢ and 
0). 

Otherwise, let 6 := 5 (@2(c,q),"M7). Then c’ = 63(6) is the code 
of the sequence of symbols written in the place where was previously 
written the sequence coded by c. We may set g?(i, x) = a3 (q’,k’, (C’)), 
where 

T(C’) = (T(C) + 32D . c) = (3"K=D . c) 
is the code of the configuration of the machine ™ at the instant t + 1, 
q = B3(6) is the state of WM, and where k’ = (63}(x) + 63(4)) +1 is the 
number of squares which are observed by the head of M at the instant 
t+1. 


One defines a function ST? € Fy 42 as follows: 
¢ ST? (i, t,x) = 0, ifi ¢ 1; 


- otherwise, ST? (i, t,x) is the code [(Sit(t)) of the situation at 
the instant t of the machine M of index i which has started to 
operate at the instant t = 0 with the following configuration: 
on B,..., By are represented the natural numbers x),..., Xp, and 
the other tapes represent 0. 


Theorem 4.4.8. The function ST° is primitive recursive for any p > 0. 
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Proof. The function 2ix.ST” (i, 0, x) is primitive recursive. (This is easy 
to see and is left as an exercise.) 

Moreover, one has ST?(i,t + 1,x) = g? (i, ST?(i, t, X)), so the result 
follows by Lemma 


An output configuration for x = (xj,...,Xp) € N? is a configuration 
where the tapes B,,...,B, represent Xj,..., Xp, the tape B,,, represents a 
natural number and where the other tapes represent 0. 


For p > 0, we define a set EP C NP+2 as follows: 


i€l, and c is the code of an 


(i,c,x) € EP: <=> ; = 
output configuration for x. 


It is easy to see that E? is a primitive recursive set. (In order to ex- 
press for example that B,,, represents a natural number, observe that 
one may bound the universal quantifier in the formula Vz(n(c, z, p+1, n) 
=2Az>3 > n(c,z-1,p +1,n) = 2) byc. The details are left as an 
exercise. ) 


We now define sets BP C NP*+? and CP C NPt3 as follows: 
Bi (ST°Gt,x)) = 1 and 
(i, B3(ST"(i, t,%)), X) € EP. 


(This expresses that at the instant tf, the machine of index i is in the final 
state, and its configuration is an output configuration for x.) 


(i,t,x) € BP: <=> | 


(i, t,x) € BP and 


(i,y,t,x) ECP: <=> : 
y is represented on B,,1. 


The sets BP and C? are primitive recursive. 


Now let f € J," be a partial Turing computable function. Choose 
a Turing machine M which computes f, and set i = 'M™". One may 
define the partial function Ty; (which gives the computing time) as 


Ty (X) := mt (i,t, x) © BP). 
One then gets 
(*) £@) = (uy S Tye))(Gy, Tye), X) € CP). 


The description (4) is called the Kleene normal form of f. 
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In particular, we have proved the following two results. 


Theorem 4.4.9. Every partial Turing computable function is recursive. 


Proposition 4.4.10. 


(1) If f is a total Turing computable function which may be com- 
puted in a primitive recursive time, then f is primitive recursive. 


(2) The set of partial recursive functions is the smallest subset of ¥* 
which contains the primitive recursive functions and which is sta- 
ble under composition and under application of the u-operator. 


(3) The set of total recursive functions is the smallest subset of F 
which contains the primitive recursive functions and which is 
stable under composition and under application of the total p- 
operator. In other words, every total recursive function may be 
obtained by a finite number of applications of the rules (RO)- 
(R3). 


Let us observe that our arguments to prove that every Turing com- 
putable function is recursive are not specific to Turing machines. This 
justifies the following thesis. 


Church’s Thesis. Every function which is computable (in the intuitive 
sense) is recursive. 


4.5. Universal Functions 


It suffices to vary the index of the Turing machine in order to obtain a 


universal function. Let us first define a function T? € F,},, as follows: 


- ifi¢ Ip, then TP(i, x) is not defined; 
+ ifi € Ip, then T?(i, x) = ut [(i, t,x) € BP]. 


We may now define a partial recursive function g? € F;",,, via 


pP(i,x) = uy[(i,y, T?(i, x), x) € CP]. 
We set og) = Ax.g? (i, X). 
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Proposition 4.5.1. gy? is a universal partial recursive function, that is, 
every partial recursive function in p variables is of the form ge for asuitable 
natural number i. 


Theorem 4.5.2 (s-m-n Theorem). For any pair of natural numbers m 
and n there is a primitive recursive function si’ © F,,, such that for all 
iE N,x € N" andy € N”™ one has 


or, y= Pym ci,z)) 


Proof. Let M be a Turing machine with at least m+ n+ 1 tapes. Given 
(a,,...,d,) © N”, consider the Turing machine M’ which operates as 
follows: 


(1) It writes aj, ...,a,, on the tapes By.42,--, Brin: 


(2) It then operates as Wf, up to permutation of the tapes, and it 
writes the result on the tape B,,,1. 


(3) It then clears the tapes B,,4>,.--, By4n41, and finally it stops. 

It is clear that there is a primitive recursive function g € F,41 
which computes the code "M"" € I, from "M" and aj,..., ay, that is, 
gM, ay,...,d,) = "M' and g(i,a) = Oifi ¢ I,4,. We may thus set 
Sieg. 


Theorem 4.5.3 (Kleene’s Fixed Point Theorem). Suppose m € N* and 
a € fF, isa total recursive function. Then there exists i € N such that 


Pi" = Pati: 
Proof. Consider g = Ayx, ...Xm.9'"(a(si"(y, y)), X15 +» Xm). By univer- 


sality (Proposition .5.1), one has g = 97+! for some index a € N. One 
gets 


Pe(sPt(yny OD = Pa” *O>¥) = P3in(a,y) 


Setting i := s{"(a, a), one obtains Pei) = gj". 


The Fixed Point Theorem provides an elegant argument to establish 
the following result (cf. Exercise f.3-1). 


Proposition 4.5.4. The Ackermann function & is recursive. 


Proof. Define a partial recursive function 6 € ¥,* as follows: 
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° Oi, y,x) = 2* ify =0; 
° Oi, y,x) =1ifx =0; 
> Oi, y,x) = v*(i, y-1, o(i, y, x~1)) otherwise. 


By universality, there exists a € N such that 6 = g3. By the s-m-n 
Theorem, this yields 


Hi YX) = Poa¢q,)%- 
Set a = Ai.sj(a,i). By Kleene’s Fixed Point Theorem (Theorem 


there exists ip € N such that gf = 9%). The function gj, is recur- 
sive and satisfies the relations which define the Ackermann function. 
Indeed, for y, x € N*, one has 
$5, (Y. x) = Op, Y, x) 
= 9 (in, y — 1, 97(in, ¥,X — 1) = 93, (Y — 1, 9%, (Y, x — 1). 
Finally, note that the totality of the function Yi, follows by induction as 
well. 


4.6. Recursively Enumerable Sets 


Definition. A set of natural numbers X C N” is called recursively enu- 
merable if there is a recursive set Y C N”*+! such that X = z(Y), where 
m denotes the projection on the first m coordinates. 


Remark. Every recursive set is recursively enumerable. 


Proposition 4.6.1. The set of recursively enumerable sets is stable un- 
der intersection, union, projection and bounded universal quantification. 
Moreover, if f,,..-, fn © Fm are recursive and X C N” is recursively enu- 
merable, then {x € N™ | (f,(X),..-,fm(x)) € X} is recursively enumer- 
able. 


Proof. We say that an n-ary relation R(x,,...,X,) on N is recursively 
enumerable if there exists an n + 1-ary recursive relation R(x, y) on N 
such that R(x) <=» JyR(x, y) holds in N. In what follows, all equiva- 
lences are meant to be in N. 


Now assume that R(x) and S(x) are n-ary recursively enumerable re- 
lations. Choose recursive relations R(x, y) and S(x, y) such that R(x) <> 
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AyR(x, y) and S(x) <> AyS(x,y). We then have the following equiva- 
lences: 


R@)V SCH) <> ay(R@y) VSR). 
R(X) ASG) <=> Ay (RE, APY) A SC, B30) - 
AzR(y,z) —> AzduR(y,z,u) —> IsR(, B7(s), 63(s)). 


Moreover, (Vz < w) R(X, Z) is equivalent to (Vz < w) dy R(x, z, y), which 
in turn is equivalent to ds(Vz < w) R(X, Z, (s)z), where (s), denotes the 
component function from Lemma 1.4) 

Finally, suppose that f,,..., f. © Fm are recursive functions. Then 


R(fi(X), ---» fr(X)) is equivalent to SyR(f,(X),..., fa(X), y), which proves 
the moreover part. 


Theorem 4.6.2. For X CN", the following are equivalent: 


(1) X is recursively enumerable. 


(2) X is empty or it is the image of a function f = (fi, .... fn) 1 N-> 
N”, where f,..., fn € F are (total) recursive. 


(3) X = dom(g) for a partial recursive function g € F,;". 


(4) There exists a primitive recursive set Y C N"*! such that X = 
n(Y). 


Proof. We may assume n = 1. Indeed, we may identify N” and N using 
the bijection a, as a, as well as the component functions £/" of aj" are 
primitive recursive. 

(1)=(2): Let r € X and let Y C N? be recursive such that X = B?(Y). 
Then X = im(f), where f € J is defined as follows: 


ae if (B7(2), B3(2)) € Y, 
r else. 


(2)>(3): If f € F, is a recursive function, we define g = Ax.ut(x = 
f(@). Then g is a partial recursive function and dom(g) = im(f). More- 
over, § = dom(g) for g = Ax.ut(x > x). 
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(3)=>(4): Let g € F;* be a partial recursive function such that X = 
dom(g). By universality there is ip € N with g = Pin» that is, 


X ={x EN | At[(ip, t,x) € B']}. 
This proves (4), as B! is primitive recursive. 
(4)>(1): Trivial. 


Remark. Given that (1) and (4) are equivalent, the proof of the impli- 
cation (1)=>(2) actually shows that one may suppose in (2) that the func- 
tions fi,..., f, are primitive recursive. 


Corollary 4.6.3. The set U = dom(g"”) C N"*? is universal recursively 
enumerable in the following sense: U is recursively enumerable and every 
recursively enumerable set X C N" is of the form U, = {x | (e,x) € U}for 
somee EN. 


Theorem 4.6.4 (Theorem of the Complement). A setX C N” is recursive 
if and only if both X and N” \ X are recursively enumerable. 


Proof. “=>” is clear. To prove “=”, assume that X C N” is such that 
there are recursive subsets Y and Y’ of N’*1 with X = z(Y) andN"\X = 
nm(Y"). Then 1x(Z) = 1y(z, ut[(z,t) Ee YUY’)). 


Theorem 4.6.5. The set dom(¢g') is not recursive. In particular, there 
exists a recursively enumerable non-recursive set. 


Proof. We will prove that the domain of the function g = Ax.g1(x, x) is 
not recursive. (In other words, one may not decide if a Turing machine 
stops when operating on its own code.) 

Let D = N \ dom(g). If D were recursively enumerable, then D = 
dom(¢;, ) for some ig € N by Corollary In particular, we would get 
ig € D if and only if ¢'(io, ip) is defined. But this is absurd, since the 
definition of g implies that i) € D if and only if ¢'(io, ig) is not defined. 


The Halting Problem HALT is defined as the set 
{i € I, | M of index i halts when operating on the empty input}. 


A slight variation of our argument in the previous proof yields the fol- 
lowing. 
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Theorem 4.6.6 (Undecidability of the Halting Problem). The set HALT 
is not recursive. 


Proof. Suppose for contradiction that HALT is recursive. It is easy to 
see that I, = dom (Ax.g1(x, 0)) is then recursive as well, where I, is by 
definition the set of indices of Turing machines which halt in an out- 
put configuration for the input 0 when operating on the empty input. 
Indeed, if i € HALT, then i € I, if and only if 


(i, 63(ST' (i, ut[Bi(ST'G, t,0)) = 1],0)),0) € EY. 


Now choose a recursively enumerable non-recursive set A C N 
(which exists by Theorem[..6.5). Let M be a Turing machine which com- 
putes a function f € F,* with dom(f) = Ax{0}. Let ig be the index of M. 
Then A = {n EN | s}(ig,n) € Ix}, and so A is recursive, contradicting 
our assumption. 


Theorem 4.6.7 (Rice’s Theorem). Let X be a set of partial recursive func- 
tions in onevariable. Suppose that X is neither empty nor equal to the set of 
all partial recursive functions in one variable. ThenI = {i EN | g} € X} 
is not recursive. 


Proof. Passing to the complement if necessary, we may assume that the 
function with empty domain is in X. Choose j € N \ I, and consider 
the function ~(x, y,z) = (¢'(j,z) + g(x, y)) ~9'(x, y). Letting p,, = 
Az.p(x, y, Z), we get 


Pry EX <—> (x,y) € dom(¢g') = U. 


at . a 34 
We have ~ = ¢j, for a natural number ip, so ~, = PG aay by 
the s-m-n Theorem. Consider h = Axyehlig; x, y), which is a recursive 


function. Then 
(xy)EN*\U <= h(x, y) EL. 


As N? \ U is not recursive by Theorem (and the Theorem of the 
Complement), J is not recursive either. 


Example 4.6.8. 


(1) Let f © F* be some partial recursive function. Then the set 
{LEN | p} = fh is not recursive. 
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(2) The set {(i, j) € N? | ge? = 9} CN? is not recursive. 


(3) The set of indices of total (recursive) functions is not recursive. 


4.7. Elimination of Recursion 


The following elementary result is crucial for the next chapter, since it 
shows that the recursive functions are definable in the language of arith- 
metic Lo,. 


Theorem 4.7.1 (Elimination of Recursion). The set of total recursive func- 
tions is the smallest subset of ¥ which contains the basic functions C9, S, 

RB", +, -, l= and 12, and which is stable under composition and the total 

[-operator. 


Let E be the smallest subset of ¥ which contains the basic functions 
c ,S, RB”, +,-, 1_ and 1., and which is stable under composition and the 
total -operator. We will temporarily (until Theorem [.7.l|is proven) say 
that a function is #-recursive if it is in E, and that a set X is #-recursive 
ifly € E. We will first prove a lemma. 


Lemma 4.7.2 (Gédel’s 6-function). 


(1) The function Axy.x-—y is #-recursive. 

(2) The collection of #-recursive sets is stable under boolean combi- 
nations and bounded quaniification. 

(3) The set {(x, y,z) € N° | x =y mod 2} is #-recursive. 

(4) The collection of #-recursive functions is stable under definition 


by cases (see Lemma f.1.2(5)), if both the functions and the sets 
of the partition are #-recursive. 


(5) There exists a #-recursive function B € F; (called Gédel’s f- 
function) such that for any finite sequence of natural numbers 
(Co, «-»»Cn—1) there area, b € Nwith B(a, b,i) = c; fori = 0,...,n—- 
1. 


Proof. (1) One has x~y = wz(x < (y+ Zz) +1). 


(2) One has lye = 1-ly and ly,y = 1x - ly, which establishes 
stability under boolean combinations. Now suppose that X C N"*1 isa 
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#-recursive set. Define a function f € F,44, 


f(x,z) := uy (&%, y) EX ory=z+1). 


As the condition in parentheses may be expressed by a condition of the 
form g(x, y,z) = 0 for some #-recursive function g, it follows that f is 
#-recursive. Stability under bounded existential quantification follows 
from the equivalence (Ay < z)(X,y) EX — f(x,z) < zZ+1. For 
Vy < z, the same follows by passing to the complement. This finishes 
the proof of (2). 


(3) One has the following equivalence: 
x=y modz = (Jw<xty)(x=yt+w-zVy=x+w-z). 


(4) Let N” = A,U...UA, be a partition with A; #-recursive for all 
i, and let fi,..., fc © F, be #-recursive functions. Then oo Ta, ° fi is 
#-recursive. 

(5) Set B(a, b,i) := uz[z =a mod ((i+ 1)b+ 1)]. By what we have 
seen, 6 is #-recursive. Now let Cp, ..., C,_; be given, and let b € N be such 
that bis divisible by n! and b > ¢; for alli. Then b+1, 2b+1, 3b+1.,...,nb+ 
1 is a sequence of coprime integers. Indeed, if a prime p divides ib + 1 
and jb + 1 for some 1 <i < j <n, then p + band p | b(i— j). Thus 
p|@-—j). Since (i — j) | n!, this is absurd. 

By the Chinese Remainder Theorem there exists a € N such that 
for alli = 0,...,2 —1 one hasa = c; mod (i +1)b+1). Asc; < 
(i+ 1)b + 1 for all i, c; is the smallest natural number z such that z = a 
mod ((i+1)b+1). 


Proof of Theorem By Proposition .4.10(3) it is enough to prove 
that the collection E of #-recursive functions is stable under recursion. 
Let g € F, andh € F,,,. be #-recursive functions, and let f € F,4, be 
the function obtained by recursion from g and h. Consider the set 
Z :={(x, y, a,b) E N"*3 | B(a,b,0) = g(x) 
and (Vi < y) (a, b,i+ 1) = h(x, i, B(a, b, i))}, 
which is #-recursive by Lemma 4.7.2. Moreover, the key property of 6 
shows that for any (x, y) € N"*1 there are a, b € Nsuch that (x, y, a, b) € 
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Z. Thus, e = Axy.us[(da < s)(Ab < s)(X,y, a,b) € Z| is a #-recursive 
function. Finally, f(x, y) equals 


uz|(da < e(X, y))(Sb < e(x, y)) ((X, y, a, b) € Z and z = B(a,b, y))], 


proving that f is #-recursive. 


4.8. Exercises 


Exercise 4.8.1. Describe a Turing machine which computes the addi- 
tion function Axy.x + y. 


Exercise 4.8.2. Let p and q be prime numbers. Then q is called p- 


n_ 
Mersenne if q = P=" for some n € N. Prove that the following set is 


primitive recursive: 
{N € N| there is a prime p such that N is a p-Mersenne prime}. 


Exercise 4.8.3. The Fibonacci function fib € F is defined via fib(0) := 
0, fib(1) := 1, and fib(n + 2) := fib(n + 1) + fib(n) for all n € N. Prove 
that fib is primitive recursive. 


Exercise 4.8.4 (Kalmar elementary functions). The set of elementary 
(recursive) functions is the smallest subset E of ¥ which satisfies the fol- 
lowing properties: 


« Econtains C , the projections Rp" for all1 <i <n, the addition, 
the multiplication and 1_; 


- ifgeA,nEand f,...,f, €F#, NE, then g(f.,..., fe) € E: 
° if f © F,4, NE, then the bounded sum 


x 
(X1,.05Xy,X) > Sf (1, +. Xp 8) 
i=0 


and the bounded product 
x 
(X1,.25Xy,X) [] fen. xn i) 
i=0 
are in E, too. 


(1) Prove that for all n, k, the constant function CZ is elementary. 
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(2) Callasubset A C N” elementary if1, is elementary. Prove that 
{0} C Nis elementary, and that the set of elementary subsets of 
N” is stable under Boolean combinations. 


(3) Prove that the exponential function exp = Axy.x” is elemen- 
tary. 

(4) We now define a function T € &, setting T(m,0) = m and 
T(m,n + 1) = exp(2,T(m,n)). For a natural number n, let 
T, = Ax.T(x, n). 

(a) Prove that T is primitive recursive. 

(b) Prove that T,, is strictly increasing for all n and that, for 
fixed m, the function (m,n) + T(m,n) is strictly increas- 
ing inn. 

(c) Prove that for every elementary function f, there exists 
n € N such that f is dominated by T,,. 

(d) Prove that T is not elementary. 


Exercise 4.8.5. 


(1) Let f © F, be an increasing recursive function. Prove that 
im(f) is recursive. 


(2) Conversely, prove that any infinite recursive subset of N is the 
image of a strictly increasing unary recursive function. 


(3) Prove that every infinite recursively enumerable set contains 
an infinite recursive set. 


Exercise 4.8.6 (A primitive recursive bijection whose inverse is not prim- 
itive recursive). 


(1) Prove that the set of recursive bijections between N and N forms 
a group. 


(2) Prove that for every Turing machine MM which computes a to- 
tal function, the graph of the computing time function Th; is 
primitive recursive. 


(3) Prove that f € F,, is primitive recursive ifand only ifits graph is 
primitive recursive and f is bounded from above by a primitive 
recursive function. 
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(4) Let g € & be strictly increasing. Prove that the graph of g is 
primitive recursive if and only if im(g) is primitive recursive. 


(5) Let f € 4 bea recursive function which is not primitive re- 
cursive, and let M be a Turing machine computing f. 
(a) Let go € F be defined by 


Zo(X) = sup{Ty(y) | y < x} + 2x. 


Prove that gp is recursive, but not primitive recursive, and 
that the graph and the image of gy are primitive recursive 
sets. 

(b) Let g, € F be a strictly increasing function such that 
im(g,) = N \ im(g,). Consider the function h € F given 
by h(2x) = go(x), h(2x +1) = g(x). Prove that h is bijec- 
tive, recursive, but not primitive recursive, and that h-lis 
primitive recursive. 


Exercise 4.8.7 (Existence of recursively inseparable recursively enumer- 
able sets). 


(1) For a natural number k € N, denote by Z, the set of alln € 
N such that n € dom(g}) and gi(n) = k. Prove that Z, is 
recursively enumerable. 


(2) Deduce from this that there exist recursively enumerable sets 
A,B C N with An B = @ such that there is no recursive set 
CCNwithA CCandCnB=9%. 


(3) Prove that there is a partial recursive unary function which 
cannot be extended to a total recursive function. 
Exercise 4.8.8. Prove that there are primitive recursive functions s), 5, € 
F, such that whenever ¢? is bijective, the components of its inverse are 
: 1 1 
given by ¢, (j and 9%, (j). 
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Chapter 5 


Models of Arithmetic and 
Limitation Theorems 


Introduction 


Our aim in this chapter is to study models of Peano arithmetic. We start 
by describing a way to encode formulas and proofs in a finite signature 
in a recursive way. This applies in particular to weak Peano arithmetic 
PAy and full Peano arithmetic PA which we introduce in A key 
result is the Representability Theorem 6.3.5 which asserts that total re- 
cursive functions are represented by 2,-formulas. Using the diagonal 
argument (Proposition [5.4.1}), we prove the theorem of Tarski on the non- 
definability of truth (Theorem and the theorem of Church on the 
undecidability of arithmetic (Theorem 5.4.5). 


In the Section 5.5] we prove Gédel’s First Incompleteness Theorem. 
The last two sections are devoted to a complete proof of Gédel’s cele- 
brated second incompleteness theorem. This requires full Peano arith- 
metic PA contrary to the previous results for which only weak Peano 
arithmetic PA, was needed. A key ingredient is the definability of satis- 
fiability for X,-formulas which is proved in Proposition 5.6.1} 


119 


Licensed to Mathematisches Forschungsinstitut. Prepared on Thu Sep 30 04:18:29 EDT 2021for download from IP 188.1.230.74. 
License or copyright restrictions may apply to redistribution; see https://www.ams.org/publications/ebooks/terms 


120 5. Models of Arithmetic and Limitation Theorems 


5.1. Coding Formulas and Proofs 


Let o© = {Aj,,...,A;} be a finite signature. We assign to each symbol 
s of £ its Gédel number "s” as follows: to =,A,7,(,),4 one assigns 
(0, 0), (0, 1),..., (0, 5), to A; one assigns (0,i + 5), and to v; G@ € N) one 
assigns (1, i). 
Toa word m = 5S, :-- Ss, on the alphabet £ we assign the Gédel num- 
ber 
#m = (s,),...,°8, 7). 

This coding is clearly injective. 
Lemma 5.1.1. The following sets are primitive recursive: 

(1) Term = {#t | t isan L-term}. 

(2) Form = {#9 | 9 is an £-formula}. 


(3) {@t,n) | t © F* and v, hasan occurrence in t} as well as 
{(#t, n) | t € T* and v,, has no occurrence in t}. 


(4) {#¢.n) |e € Fml* and Vy has an occurrence in gp}, and simi- 
larly for ‘has no occurrence? ‘has at least one free occurrence} 
‘has no free occurrence} ‘has at least one bound occurrence’ 
and ‘has no bound occurrence’ instead of ‘has an occurrence? 


(5) {#9 | gy is an L-sentence}. 


Proof. We use properties of the coding function (...) for finite sequences 
(cf. Lemma and the primitive recursive decoding function in 
two arguments Axi.(x);. In particular, we use the inequality n = 
Ig((so, sa88 Sn-1)) < (So, seep9 Sani) 

By unique reading properties, we may not only argue by induction 
on length, but also on height of terms and formulas. For instance, if x is 
the code of a word m starting with ‘(’, then x is the code ofa formula ifand 
only if there exist y,, yz < x such that y, = #9; for some formulas ¢), ~2 
such that m is the word (9g A 2). The details are left as an exercise. 


Recall that we have defined substitution in terms and formulas. If 
Xj,...,X, are n pairwise distinct variables, and sj, ...,s,, are terms, we de- 
fined t;,z and 95)z (see Section 2.4). One easily proves that the functions 
S, : N? + Nand Sr : N? > N are primitive recursive, where S; is the 
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function assigning to (a, b,c) the integer HES/Y, Vin ifa = (i,...,i,) is 
the code of a sequence of pairwise distinct natural numbers, b = #t € 
Term and c = (#5),..., #5,), with #s; € Term for any i, and 0 otherwise. 


The function Sf is defined in a similar way: to 
(a, b,c) = (iy, ..., in), FQ, (#51, ..., #FS_)) 


one assigns 


S-(a, b, c) = FQs/v;, seDin - 
In particular, one obtains the following statement: 


Lemma 5.1.2. There exist primitive recursive functions Subst, and Subst ¢ 
in J; such that, for any n, if s and t are terms and ¢ is a formula, then 


Subst,(n, #s, #¢) = #t,,,, and Subst ,(n, #8, #9) = #Qs/p,,- 


Similarly, one encodes formulas from propositional calculus, via F +» 
#F EN, with F € Fml,. 


Lemma 5.1.3. The set Taut of codes of tautologies for the predicate cal- 
culus is primitive recursive. 


Proof. One checks directly that 


« the set Form, = {#F | F € Fml>} is primitive recursive; 


* the function assigning to (d, x) the truth value (0 or 1) of the 
formula F = F[pgo,..., Py_1] for the distribution of truth values 
d = (do,...,dn_1), with d the code of a length n sequence of 
0’s and 1’s, and x = #F the code of F € Fml» containing only 
propositional variables p; with i < n — 1, and 0 otherwise, is 
primitive recursive; 


« the set Tautp = {#F | F is a tautology} is primitive recursive; 


* there exists a primitive recursive function assigning to the 
pair ((#Q,...#n_1), #F) the natural number #F 5 if F = 


F [Dos +» Pn-1]: 
This is enough to get the result. 
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Similarly one checks that the set of codes of the equality axioms is 
primitive recursive and also the set of codes of the quantifier axioms (tak- 
ing into account Lemma§..1.2). We have thus proved the following state- 
ment. 


Theorem 5.1.4. The set Ax of codes #9 of logical axioms 9 of £ is primi- 
tive recursive. 


Coding formal proofs. We encode a finite sequence of £-formulas d = 
(Po> +» Pn—1) by ##d = (H#Go,..., #Pn—1). 


Lemma 5.1.5. The set 
Prf = {(##d, #¢) | dis a formal proof of y} 


is primitive recursive. 


Proof. Given (x, y), one first checks if y € Form, that is, if y = #9 for 
some £-formula ¢. If yes, it suffices to decode and to test whether all 
components of x are codes of £-formulas ¢;, the last one being equal to 
gy, and whether for every i, either g; is a logical axiom (which is a prim- 
itive recursive property by Theorem or it can be obtained using 
deduction rules from previous formulas. 


Proposition 5.1.6. The set U = {#@ | + 9} is recursively enumerable. 


Proof. We haven € U © id (d,n) € Prf. 


5.2. Decidable Theories 


Let T be an £-theory. We denote by Thm(T) = {g £-sentence | T + o} 
the deductive closure of T. 


Definition. Let T be an £-theory. 


(1) T is called recursive if {#@ | p € T} is a recursive set. 


(2) T is called recursively (or effectively) axiomatizable if there ex- 
ists a recursive £-theory T’ such that Thm(T) = Thm(T’). 


(3) T is called decidable if the set #Thm(T) = {#¢ | T F ¢}is 
recursive. 
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If T is an £-theory, one sets 
Prf(T) = {(##d, #¢) | d isa proof of ¢ in T}. 


Lemma 5.2.1. If T is recursive, then the set Prf(T) is recursive. 


Proof. Exercise. 


Theorem 5.2.2. If T is effectively axiomatizable, then #Thm(T) is recur- 
sively enumerable. 


Proof. Let Ty be a recursive theory such that Thm(T) = Thm(To). 


We have g € Thm(T) if and only if g is an £-sentence and there ex- 
ists a proof of g in Ty. One concludes by Lemma§5.1.1(5) in combination 
with Lemma 6.2.1] 


Remark 5.2.3. The converse of Theorem holds, too. 
Proof. Assume the set #Thm(T) is recursively enumerable. There then 
exists a recursive function f € F such that 

#Thm(T) = {f@ = #9; | i € N}. 


The function g which to n assigns # ome gy; is recursive and we have 


g(n) > n for every n. Its image is therefore a recursive set. Since { ee 0 Fi | 
n € N} is an axiomatization of Thm(T), this finishes the proof. 


Theorem 5.2.4. Assume T is effectively axiomatizable and complete. Then 
T is decidable. 


Proof. By Theorem we already know that #Thm(T) is recursively 
enumerable. Its complement N \ #Thm(T) is the union of {#@ | apg € 
Thm(T)} and {x | xis not the code of an £-sentence}. The former set 
is recursively enumerable since #@ » #7¢ is given by a (primitive) 
recursive function, while the latter one is primitive recursive by 


Lemma §5.1.1(5). One concludes by Theorem [.6.4. 
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Example 5.2.5. 


(1) Any finitely axiomatizable theory is effectively axiomatizable. 


(2) ACF,, p prime or 0, is decidable. 
[Indeed, the axiomatization is clearly recursive, and one con- 
cludes by combining Theorem (ACF, is complete) with 


Theorem 6.2.41] 


(3) ACF is decidable. 
[Indeed, as ACF is effectively axiomatizable, it suffices to prove 
that X = {#9 | g isa sentence and ACF F 9} is a recursively 
enumerable set. But one has ACF F gifand only if there exists 
p prime such that ACF, F 79, as follows from the Lefschetz 
principle, which is equivalent to ACF F y, > 79, where x, is 
the formula 1 + ---+1=0. 
™—-——_’ 
p times 

It follows that #g € X if and only if there exists a prime p 
and y such that (y, #(Xp > =g)) € Prf(ACF). Since the map- 
ping (p, #~) +> #(%p > 7¢) is (primitive) recursive, one con- 
cludes that X is recursively enumerable.] 


(4) The theory of the ordered field of real numbers & is decidable 
(this is a result of Tarski). 
[Since it is complete, it suffices by Theorem to give an 
effective axiomatization of Th(R). One can prove that any or- 
dered field (K; +, —, 0,1, -, <) satisfying the intermediate value 
property for polynomial functions (that is, if P(x) € K[X] and 
a,b € K witha < band P(a) - P(b) < 0, there exists c € K 
such that a < c < band P(c) = 0) is elementarily equivalent to 
R. To prove this, one shows that the theory we just described 
admits quantifier elimination in the language of ordered rings, 
for which we refer to [[12]].] 


Exercise 5.2.6. If T is decidable and @p,...,@,_; are £-sentences, then 
T U{Go,--»; Pn_1} is decidable. 
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5.3. Peano Arithmetic 


We shall work in the language of arithmetic £,, = {0, S,+,-,<}. In any 
£Lor-Structure, for any n € N, one defines by induction n, setting 0 = 0 
andn+1 = S(n). 


Definition. 


(1) The set of weak Peano axioms is the finite set PA, consisting of 
the following 8 axioms: 
(A1) Vup 7SUp9 = 0. 
(A2) Vup Jv,(4U9 = 0 > Sv, = Up). 
(A3) VupVv, (SUg = SU, > Up = vj). 
(A4) Vu U9 + 0 = Up. 
(A5) VupVv, Up + SU, = S(Up + V;). 
(A6) Vv Up - 0 = 0. 
(A7) Wp Vv, Up - SU, = (Ug - V1) + Uo. 
(A8) VWup Vv, (Up < LV, & (Av2U2 + Ug = Vy A7Ug = Uj)). 
(2) The set of Peano axioms is the infinite set PA consisting of PAg, 
and for each £g,-formula ¢(Up, ..., U,) of the following axiom 
of induction (where U = (0j,...,U,,)): 


Vv, Wv2 +++ WUn([P(O, v) A VUp(P(Up, Vv) > G(Sup, v))] > Vp (Uo, V)). 
Remark 5.3.1. 


(1) One has %, := (N50, succ, +, -,<) F PA. 


(2) PAg is an expansion by definition of the £4, \ {<}-theory (A1)- 
(A7). (This is a consequence of (A8).) 


(3) PAg isa quite weak theory. One may construct models in which 
addition (or multiplication) is not commutative or associative; 
also, there are models in which < does not define a total order. 


(4) An element in a model of PAg is called non-standard if it is 
different from the interpretation of n for any n € N. By com- 
pactness, there exist models of PA (and thus of PA) containing 
non-standard elements. 


Definition. Let M@ C M’ be L,,-structures. One says that M is an initial 
segment of IN’ if the following two conditions hold: 
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¢ For all a’ © M’ and alla € M such that M’ EF a’ < aone has 
a’ eM. 
* Foralla’ € M’\ Mandalla €M one has M’ Fa<a’. 


Lemma 5.3.2. Let MF PAg. Then N = {n™ | n € N} is the base set of a 
substructure of M which is isomorphic to N,_ and an initial segment of M. 


Proof. Let % F PAg. One proves: 


(i) For any m,n € None has PAyg Fm+n=m+n. 
This is proved by (naive) induction on n € N, using (A4) 
and (A5). 
(ii) For any m,n € N one has PAg Fm-n=m-n. 
This is proved by induction on n, using (A6), (A7) and (i). 
(iii) PAg F VxVy(x < y @ Sx < Sy). 
Indeed, if a, b,c € M, one has the following equivalences: 


(A3) 
MEct+a=bAr7a=b — ME S(c+a) = SbA-7ASa = Sb 


(A5) 
—= MEc+Sa=SbA-7Sa= Sb. 


The statement thus follows from (A8). 


(iv) For anyn € None has PAy F Vx(x <no a x =i). 

This is proved by induction on n. If % F c < 0 for some 
c EM, then (in M) a+c = O for some a € M andc # 0 by (A8), 
hence c = Sd for some d by (A2), and by (A5) one infers that 
0 =a+Sd = S(a +d), which contradicts (Al). This proves 
the case n = 0. For the induction step n + n +1, one uses (iii) 
together with the fact that 0 < c for any c 4 0 (a consequence 
of (A4) and (A8)). 

(v) Letc€M\NandneN. Then MEn <c. 

This is proved by induction on n, the case n = 0 being 
clear. For the induction step n + n +1, note that c = Sd for 
some d ¢ N. Hence M F n < d, which proves MF n+1<c, 
using (iii). 


This concludes the proof. 
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Up to naming elements of N by constants (which is irrelevant since 
any element of N is the interpretation of a term), we have in particular 
proved that if M F PAg, then M F A(Xg¢). 


Definition. 


(1) The set of 2, -formulas is the smallest set of £,,-formulas con- 
taining the quantifier-free formulas and which is stable under 
A, V, existential quantification 4x and bounded universal quan- 
tification (Vx < t), with t a term not depending on the variable 
x. Here, (Vx < t) gis defined as Vx(x < t > ¢). 


(2) The set of strict 2, -formulas is the smallest set of £,,-formulas 
containing the formulas0 = x,Sx =y,x+y=2,x-y=zZ, 
xX = y,7xX = y,x < y, 7x < yand which is stable under A, v, 
dx and, assuming that x and y are two distinct variables, under 
Vx <y. 


Lemma 5.3.3. 


(1) Any 2,-formula is equivalent to a strict X-formula. 


(2) If visa X,-formula, then any formula 95,z obtained by substitu- 
tion is a X,-formula, too. 


Proof. Proceeding as in the proof of Lemma 8.3.1] one may eliminate 
complex terms by using existential quantifiers. For instance, the formula 
x- Sy = Ois equivalent to dudu(Sy = uAx-u=vUAO0= v). The second 
statement is clear. 


Proposition 5.3.4. Any X,-sentence satisfied in N,, is a theorem of PAo. 


Proof. By Lemma it is enough to prove that for any strict Dy- 
formula ¢(x1,...,xX,) and any m),...,m, € N we have 


Nop F p[m,..., Mn] > PAg F g(m, as My). 


We shall prove this by induction on the height of the formula starting 
from the case when ¢ is a quantifier-free formula, which follows from 
Lemma 5.3.2. 

Assume now that the statement holds for g(x ,...,x,) and for 
W(Xo,...,X,). Clearly it then also holds for gp A p~ and yg v py. We now 
prove that it holds for AxqQ(Xo,...,X,). If Moe F Axqg[m),...,m,], then 
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there exists mj) € N such that N,, F g[mo,...,m,], which implies PAg F 
y(mMp, ..., Mp,) by the induction hypothesis; hence PAg F Axp¢(mj,..., My). 
Finally, if N F (Wx9 < x,)o[m,...,m,], then by definition of 
VxX9 < x, one has Ny F glmo,m,...,m,] for any my < my. By the 
induction hypothesis, we then have PAg F ¢(mp,...,m,) for any mp < 
m,. By (iv) in the proof of Lemma one deduces PAg F (Vx < 
X1)P(M, ... Mn). 


Definition. 


(1) Let f € F. One says that the formula g(x, ..., Xp41) represents 
f if for any n,..., ny, € N one has 


PAg FE Vy (o(m, Mp Y) OY= FM, --Mp)). 


(2) Let A C NP. One says that g(x),..., xp) represents A if for any 
Ny, Np E N one has 


n € A= PAoF 9(ny,..., Np) 
and 


néAz=> PAE =9(N,, seule 


Remark. If 9(x;,..., Xp) represents A, then 1, is represented by the for- 
mula (9(X) A Xp41 = 1) V (79(X) A Xp41 = 0). Conversely, if px, Xp41) 
represents 14, then ?(x, 1) represents A. 


Theorem 5.3.5 (Representability Theorem). Every total recursive func- 
tion is represented by a X,-formula. 


Proof. Let S be the set of total functions that are represented by a 2)- 
formula. By Theorem it is enough to establish the following three 
facts: 

(i) The basic functions S, P”, i. +,-,1. and 1, are in S. 

(ii) S is stable under composition. 


(iii) S is stable under the total -operator. 
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Item (i) is clear, since the basic functions are all represented by quantifier- 
free formulas (which are 2, by definition). 

If fi.--» fp © Fm are represented by the formulas ¢),...,@p, and if 
g & F, is represented by y, then h = g(f.,..., fp) © Fm is represented by 


p 
Fp, Wp [\ GG) A POD = YpoXmat): 
i=1 
In particular, this proves (ii). 

Finally, let f € 441 € S, and let g € F, be the function obtained 
from f by the total y-operator, that is, g(x) = uy (f(x,y) = 0). Let 
A = {@, y) | fy) = 0} C NPt. Applying (i) and (ii), we get 14 = 
Axy.1_(f(x, y),0) € S. Choose a Z,-formula 9(x, y, z) representing 1,4. 
Then the function g is represented by the 2,-formula ~(x, y) given by 
p(x, y,1) A (Vy’ < y) p(x, y’, 0). This proves (iii). 


Corollary 5.3.6. A subset A C N” is recursively enumerable if and only if 
there exists a X,-formula defining A in the structure Ngq. 


Proof. Any subset defined by a quantifier-free formula is (primitive) re- 
cursive, hence a subset defined by a 2,-formula is recursively enumer- 
able by the closure properties of recursively enumerable sets, their stabil- 
ity under existential quantification (that is, under projection), bounded 
universal quantification, intersection and union (Proposition f-.6.1). 


Conversely, since any recursively enumerable set is the projection of 
a recursive set, it is enough to prove that any recursive set is defined by 
a 2,-formula, which follows from the Representability Theorem. 


Proposition 5.3.7. Let IN be a model of PA. Then the following properties 
are satisfied in M: 


(1) + and - are commutative and associative. 
(2) - is distributive with respect to +. 


(3) < defines a total order, compatible with + (ifa < b, thena+c< 
b+ c) and with multiplication with a non-zero element (ifa < b 
and c # 0, thena-c < b-c); 0 is the minimal element for this 
order, and for any a, Sa is the immediate successor of a. 
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(4) Any element is regular for addition (a+c = b+c => a= b), and 
any non-zero element is regular for multiplication (ifa-c = b-c 
and c # 0, thena = b). 


Proof. All these properties are easily proved using the induction axioms 
of PA. As examples, let us provide the arguments for associativity and 
commutativity of addition. The proof of the remaining properties is sim- 
ilar. 


Associativity of addition. 


Let (x, y, z) be the formula x +(y+z) = (x+y) +z. In M, we have 
a+(b+0)=a+b= (a+b) +0 by (A4), and hence M F Vx, y p(x, y, 0). 
If a,b,c € M verify (a+b) +c=a+(b+c), then 


a+(b+Sc)=a+S(b+c) = S(a+(b+c)) 
= S((a+ b) +c) =(a+b)+Sc 
by (A5), whence IN F Vx, y,z(~(x, y,Zz) > (x,y, Sz)). The induction 
axiom for g implies that IN F Vx, y, z p(x, y, Z). 
Commutativity of addition. 


One checks the following properties: 
(i) MF VxO+ x = x (by induction, using (A4) and (A5)). 
(ii) MF Vx 1+ x = Sx (by induction, using (i) and (A5)). 


(iii) MF Vx,yx+y =y+x (by induction, using (i), (ii) and asso- 
ciativity of addition). 


Remark. By Proposition PA suffices for the purposes of basic 
number theory. However, as we will see later in this chapter, PA is not a 
complete theory. Concrete mathematical statements independent from 
PA are not so easy to provide, but it is known that for instance Good- 
stein’s theorem from Exercise is such an example. 


Lemma 5.3.8 (Overspill). Let % be a non-standard model of PA (that 
is, a model containing a non-standard element) and let p(x) be an £Lo,- 
formula. If M@ F en) for every n € N, then there exists c € M non- 
standard such that M F yfc]. 
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Proof. Otherwise, g would be satisfied precisely by the standard ele- 
ments, yielding IN F y(0) and M F Vx(P(x) > —P(Sx)). But then M F 
Vx g(x), which is a contradiction since M contains non-standard ele- 
ments by hypothesis. 


5.4. The Theorems of Tarski and Church 


We fix a primitive recursive function subst € % such that if p(v) 
is any £,,-formula with one free variable v and n € N, then one has 
subst(#9, n) = #¢(n). Note that such a function subst clearly exists. 


By the Representability Theorem, we may choose a %,-formula 
G(x, y,z) representing subst. Given a formula g(v) with one free 
variable v, we consider the formula 


Hy(x) := 4z (G(x, x,z) A(z), 


and we set Ng := #7Hg and Ag := “He (Ng). 


Proposition 5.4.1 (The Diagonal Argument). Let p(v) be a formula with 
one free variable, in the language £4,. Then 


PAg F 9(#Ag)  7Ag. 
Proof. Let 9% be a model of PAy. Since by the above definitions 
subst(ng, Ng) = subst(#7Hg, Ng) = #Ag, we have 
(*) ME VZ (cing, Ng, Z) ez= Hy), 
and thus a _ 
Mk gH Ag) & Mk Az (Gly. Ng.2) A P(e) & MF Wy 


Ho (ng) 


by (4) and by the definition of Ag, respectively. 


Corollary 5.4.2 (Fixed Point Theorem). Let p(v) be an £q,-formula with 
one free variable. Then there exists an £q,-sentence tb such that 


PAg k o(#p) © p 


Proof. One may take A_, as ¢ and use the Diagonal Argument. 
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Theorem 5.4.3 (Tarski’s Theorem on the Non-definability of Truth). Let 
MF PAg. Then there is no £q,-formula S(v) with one free variable such 
that for any Lq,-sentence w one has NF p if and only if M F S(#y). 


Proof. Assume S is such a formula. By Proposition §.4.1]one has PAg F 
S(#Ag) @ 7As. In particular, M@ F S(#Ags) — Ag, which is absurd in 
view of the property satisfied by S. 


Corollary 5.4.4. There is no £4,-formula Sy_,,(v) such that for any £g,- 
sentence ~ one has N F p ifand only if Ny F Sy. (#4). In particular, 
Th(M,;) is undecidable. 


Proof. The first part is a special case of Theorem[.4.3. Assume now, for 
contradiction, that T = Th(%,;) is decidable. This means that #Thm(T) 
is a recursive set and thus representable by a X,-formula g(v) by Theo- 
rem §.3.5. For any £,,-sentence ~ we then have %,, F ~ if and only if 
N, F p(#y), contradicting the first part. 


Theorem 5.4.5 (Church’s Theorem). Let T D PAg be a consistent £g,- 
theory. Then T is undecidable. 


Proof. Otherwise, #Thm(T) would be a recursive set, and there 
would exist, by the Representability Theorem, a formula t(v) represent- 
ing #Thm(T). Applying Proposition 6.4.1] we would get 


TEA, @ #A, € #Thm(T) @ PAy F 1(#A,) @ PAy F 7A, 


which is absurd, since by assumption T is a consistent theory with T D 
PAg. 


Church’s Theorem yields undecidability of the Predicate Calculus: 


Corollary 5.4.6 (Church). There exists a finite language £ such that U = 
{#9 | p is a universally valid £L-formula} is not recursive. 


Proof. Let £ = £,,. Assume U is recursive. Then, as a formula g 
is universally valid if and only if a universal closure of ¢ is, the empty 
Lay-theory is decidable. Since PAg is a finite theory, PAg would then 
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be decidable by Exercise contradicting the Theorem of Church. 
Indeed, 
8 
p €Thm(PAy) @ PAg pet /\ Aj > 9 © oy € Thm(). 
i=1 
Yo 

Remark. One can prove that the corollary already holds for £ consist- 
ing only of a binary relation symbol, that is, £,,, = {€}. Examples of 
simpler languages for which this no longer holds are discussed in Exer- 


cise 5.8.3). 


5.5. Gédel’s First Incompleteness Theorem 


From Church’s Theorem we may now derive Gédel’s First Incomplete- 
ness Theorem, which Gédel had proved prior to Church’s work. 


Theorem 5.5.1 (Goédel’s First Incompleteness Theorem). Let T be a con- 
sistent and recursive £q,-theory such that T D> PAg. Then T is not com- 
plete. 


Proof. If T were complete, it would be decidable by Theorem con- 
tradicting Church’s Theorem. 


In the remainder of this section, we shall keep the hypotheses of 
Theorem namely T D PAg is consistent and recursive. We will 
now present an improvement of Theorem due to Rosser, providing 
an explicit sentence ~ such that neither TF ~ nor TF —). 


We recall that the set 
Prf(T) = {(#¢, ##d) | d is a formal proof of gy in T} 
is recursive. By the Representability Theorem, there exists a 2,-formula 
P(x, y) representing Prf(T) C N?. 


Let Neg : N —> N bea primitive recursive function such that 
Neg(#¢) = #79 and Neg(x) = 0 if x is not the code of a formula. Fix a 
&,-formula v(x, y) representing Neg. We set 


PR(x,y) := Pry) A(z < y)Su(Pr(u, z) A v(x, u)). 
Let hE(x) = IyPR(x, y), and AR = Apk- 
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Theorem 5.5.2 (Rosser’s Variant of the First Incompleteness Theorem). 
Let T D PAg be consistent and recursive. Then T K AR. and T K AAR. 


Proof. We set A := A% andm := #A. By the Diagonal Argument 
(Proposition 5.4.1), we have 


(+) PAy F AyPR(m, y) @ 7A. 


Assume T F A. Then there exists a proof of A in T, thus there exists 
p € N such that (m, p) € Prf(T), whence PA, - P,(m, p). Since T is 
consistent, for any p’ € N we have PAg F 7du (v(m, u) A Pru, p’)). It 
is enough to note that PAyg Fk (x < po a x = i) to deduce that 
PAy F PR(m, p). By (fi), we have PAy F 7A, and hence T + WA (since 
T D PAg). This is a contradiction. 

Assume T + 7A. Then there exists a natural number p such that 
PAy - Pr(#7A, p). Since T is consistent, for any p’ € N we have PAg F 
=P,(m, p’). It follows that PAg F a3yPR(m, y), since %,, is an initial 
segment of any model of PAy by Lemma[5.3.2, By (ff), we have PAy F A, 
and hence T - A, which is a contradiction. 


Remark. Let T D> PAg be a consistant and recursive £,,-theory. Let 
hy(x) := AyPr(x, y) and Ap := A). Then T ¥ Ar. 


Proof. The proof is similar to that of Theorem and left as an exer- 
cise. 


5.6. Definability of Satisfiability for =,-formulas 


By Tarski’s result (Theorem |5.4.3)), there is no formula expressing satis- 
fiability for all arithmetical sentences. However, if one restricts to X,- 
sentences, one has the following representability result: 


Proposition 5.6.1. There exists a 2,-formula Sats, (v) such that for any 
x,-sentence ~ one has 


PAF 9 @ Sats, (#¢). 
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Before we give a proof the proposition, we prove a result which guar- 
antees that various arguments involving coding may be carried out in 
any model of PA. 


Lemma 5.6.2. Let f € F,, bea primitive recursive function. Then there is 
aX,-formula yf (x1 ..., Xn, y) representing f such that 


PAF VX),..,XnA! Xp. 


Proof. We say that function f € F is provably total x, if the conclusion 
of the lemma holds for f. We will only sketch the argument and leave 
the (rather tedious) details to the reader. 


Claim 1. Gédel’s B-function B € F; is provably total dy. 


Recall that 6(x,, x2,x3) = u¢z[zZ =x, mod (x,(x3 +1) +1)]. Onechecks 
that the formula 7g(x;, x2, x3, y) given by 


(*) y <X2(xX3 +1) 4+ 1A dUx, = Xp + V((X3 +1) +1) 


is a X,-formula which represents 6 and defines a total function in any 
model of PA. This proves Claim 1. 


Claim 2. Let M be a model of PA, y(x, y) an Lq,-formula with parameters 
from M andi € M such that M F (Vj < iA! yy(j, y). Then there are 
a,b € M such that MF (Wj < i)Vy(ye(a, b, j,y) = YU. y)), with xg asin 
(). 

This holds in N,, by Lemma f.7.2(5). If it did not hold in M, there 
would be a minimal ig where it fails. The proof that if it holds for ig — 
1, then it holds for ig only uses basic number theory and may thus be 
carried out in % F PA. Thus, a minimal counter-example cannot exist, 
proving Claim 2. 

Claim 3. The set of provably total X,-functions contains the basic functions 
(S, B” and C$), and it is stable under composition and recursion. 


The statements about the basic functions and composition are clear. 
To prove stability under recursion, suppose f € F,,, is defined by recur- 
sion using g € F, andh € F,,, and that p(x),...,X,, y) is a 2,-formula 
representing g which defines a total function in any model of PA, and 
similarly 0(x),..., X42, y) for h. Using Claim 2, one may then construct 
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ad -formula 9(x),...,Xy41, y) representing f which defines a total func- 
tion in any model of PA, by coding the computation of f as in the proof 


of Theorem 


Remark 5.6.3. Consider the expansion by definitions of PA given by 
adding new symbols for the functions defined by the formulas y, as in 
Lemma By the proofs of Lemma and Proposition not- 
ing in addition that the formula +(x, y) is equivalent in PA to the 2,- 
formula 4z(yf(x,z) A y # 2), it follows from Lemma that every 
&,-formula in the expansion is equivalent to a 2,-formula in £,,. Be- 
low, we will construct X,-formulas through this process. Abusing nota- 
tion, we will use the same letter f for the function symbol and for the 
corresponding primitive recursive function. 


We will in particular use the binary function (x); to decode sequences 
in 9% F PA which are of ‘finite’ length in the sense of M. 


Proof of Proposition By Lemma any 2 -formula ¢ is logi- 
cally equivalent to a strict X,-formula. It follows from the proof of that 
lemma that there exists a primitive recursive function f such that, for 
any 2-formula 9, f(#¢) is the Godel number of a strict 2, -formula log- 
ically equivalent to g. It is thus enough to prove the existence of a X- 
formula S’(v) such that for any strict X,-sentence g, one has PAF 9 © 
S'(#9). 

By a certificate of a strict 2,-formula gy we shall mean a finite 
sequence (¢},...,9,) of strict 2,-formulas with g, equal to g and a 
finite sequence (s",..., s”") of finite sequences of natural numbers s” = 
(SG, «+5 Sa(m)-) such that the following conditions hold: 


(i) All free variables of g,, belong to Uo, ..., Ve(m)-1- 
(ii) If g,, is the formula 0 = v; then 0 = sj". 


(iii) If ~, is the formula S(v;) = yj then 7” + 1 = s;". 
(iv) If%m is the formula v; + vj = v;, then sf" + 5" = sj. 
ry = Ske. 


(vi) If is the formula v; = uy; then sj" = s;". 


(v) If@m is the formula v; - uj = vy then s 


(vii) If is the formula 7v; = u; then sj” # 57". 


(viii) If @ is the formula uv; < uv; then sj" < 5". 
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(ix) If is the formula 7; < vu; then sj" < 57". 


(x) If g, is the formula g’ A y” then dm’, m" <m such that 9’ 
equals ¢, p” equals gy,” ands” =s™ =s5s!™ . 

(xi) If ¢,, is the formula gy’ Vv g” then dm’ < m such that ¢’ equals 
Pm and s™ = s™ or am" < msuch that gy” equals gy,” and 
sm = gm" 

(xii) If gy, is the formula 4u;¢' then 4m’ < m such that g’ equals 
Pm and se = sf", Wk < min(a(m), a(m’')), k # i. 

(xiii) If, is the formula (Vu; < u;) 9’ then Va < si" dm’ <msuch 


that g’ equals g,,/, s” = a and such that lg = sf’, Vk < 
min(a(m), a(m’)), k # i. 


This notion of certificate makes sense in any model §% of PA and one 
has 


ME “phasa certificate” if and only if MF ¢g. 


This is checked by induction on #¢@ in the model M (it is crucial here 
that § is a model of PA and not only of PAg). 


Furthermore, one may express the existence of a certificate for g in 
IM by aX, -formula in the Géddel number of gy. More precisely, there exists 
a 2,-formula S’(v), such that in any model M of PA, for any code #9 of 
a ‘strict X,-formula’ (even non-standard), IN F S’(#¢) if and only if MF 
“g has a certificate”. One constructs S’(v) by coding conditions (i)-(xiii) 
that are clearly expressed by 2,-conditions. The equivalence between 
MF S'(#—) and M F “~p has a certificate” is checked by induction on #¢ 
in the model MN. Here again it is important that IM is a model of PA. 


5.7. Gédel’s Second Incompleteness Theorem 


Let T be a recursive £,,-theory containing PA. To state and to prove the 
Second Incompleteness Theorem, we will explicitly construct a specific 
formula expressing that a sentence is provable. 


We consider ~,-formulas Sen(v), Ax;(v), MP(U9, U;, U2) as well as 
Gen(up, U,) representing respectively the set of Gddel numbers of sen- 
tences, the union of the sets of Gddel numbers of logical axioms and of 
those of T, the set of triples (#¢, #~), #6) with 9, p and 6 formulas such 
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that 6 is obtained from g and p by modus ponens, and the set of pairs 
(#9, #~) with g and » such that > is obtained from ¢ by generalization. 


We shall also use Gédel’s 6-function to code sequences. We consider 
the following formula B(a, b, n): 


Vi < n[ Axr(8(a, b, i)) v (Aj, k < i) MP(B(a, b,j), B(a, b, k), B(a, b, i) 
v (Aj < ) Gen(B(a, b, j), B(a,b,i))]. 
The formula Prr(m) given by 
Sen(m) A da, b, n(6(a, b,n) = m A B(a, bn + 1)) 


expresses that m is the Gédel number of a sentence having a formal proof 
in T. Ifgisasentence, T| gif and only if N,, F Pr;(#¢). In particular, 
if p isa Z,-sentence, N,, Fp => Ney F Pr(#). In order for this 
property to hold in any model of PA, we are led to consider the following 
variants of B and Prr. 


One defines the formula B(a, b, n) by 


Vi < n[ Axr(6(a, b,i)) V (Aj, k < i) MP(6(a, b, j), B(a, b, k), B(a, b, i) 
V (Aj < i) Gen(6(a, b, j), B(a, b, i) V Saty, (B(a, b, i)| 
and Pr;(m) by 
Sen(m) A da, b, n(B(a, b,n) = m A B(a,b,n + 1)). 
Finally, if g is a sentence we write Lr¢ for Pr7(#¢). 
By construction the following properties hold: 7 
Proposition 5.7.1. 
(1) If g is a X,-sentence, 
PAF ¢ > [Ir¢@. 
(2) For any sentence p, 


Ne FOry — Try. 
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Proof. Let ¢ be a X,-sentence and let §% be a model of PA. By Propo- 
sition if M F ¢g, it follows that Mm F Saty, (#¢), and hence also 
MF Lir¢y. This proves (1). 

Note that if g is a X,-sentence which is true in %,;, then TF @. Part 
(2) thus follows from the construction of Pr;(m). 


Proposition 5.7.2. Let y and p be sentences. The operator (|r satisfies 
the following properties (Loeb’s axioms): 

(Ll) Tko => THOrg. 

(L2) TF (Ore AO ry > ¥)) > Ory. 

(L3) TF Ore > Orie. 


Proof. (L1) By Proposition .7.1(2), if T + 9, then ®,, F Org. Since Prr 
is Xy, it follows that _]7¢ is satisfied in any model of PAg, soin particular 
in any model of T. 
(L2) Arguing in an arbitrary model of PA, it follows from modus 
ponens that PAF (rg A Lir(¢ > )) > Ory, which implies (L2). 
(L3) is a special case of (1) in Proposition since [|r¢ is a y- 
sentence and T D PA. 


Corollary 5.7.3. Let ¢ and wp be sentences. [fT + g > », thenT + 
Ore > Ory. 


Proof. Assume T | ¢ > w. Then, by (L1), T+ L1r(¢ — ¥). It follows 
thatT ' Ore - (Ore A Or(e — )). By (L2) one deduces that 
TF Ore > Ory. 


Theorem 5.7.4. Let T be a recursive £,,-theory containing PA. Let p be 
a sentence. Then 


TF OrOre > 9) > Ore. 


Proof. There exists a formula F(x) such that for any sentence 0, F(#@) 
is equal to the sentence []7@ —> g. By Corollary there exists a 
sentence jp such that 


(*) TEP (Ord > 9). 
In particular, 


TEP Ord > 9); 
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hence, by Corollary 
TFEOre > Orr? > ¢). 
It follows from (L3) that 
TEOrY — OrOre A OrOry > ¢9)). 
Since by (L2) 
TE OrOre A Orr? > ¢)) > Ore, 
one deduces that 
(*) TE Or? > Ore. 
From (+) one gets that 
TE (re > ) > Or? > 9), 
thus by (+) it follows that 
TE (Ore > 9) > ¥. 
By Corollary one infers that 
TF OrOre > ¢) > Ory, 
and using (**) one deduces that 


Tt Or(Ore > 9) > Ore, 
which ends the proof. 


Corollary 5.7.5. [fT + Lrg > 9, then T+ L1r¢. 


Proof. Indeed, if T + Tre — 9, then it follows from (L1) that T F 
Clr(Or¢ — 9), so we may conclude by Theorem 


Since [Jrg — ¢ is logically equivalent to -_|r¢ V ¢, we obtain the 
following corollary. 


Corollary 5.7.6. If T + ~[]r¢, then T + Lir¢. 


This corollary implies in particular the following famous result. 


Theorem 5.7.7 (Gédel’s Second Incompleteness Theorem). Let T be a 
recursive £q,-theory containing PA. Then if T is consistent, for any sen- 
tence p, T K =(]r¢. In particular, let Cony = =[])(0 = 1), the sentence 
expressing the consistency of T. Then T K Cont. 
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5.8. Exercises 


Exercise 5.8.1 (Presburger arithmetic). We consider the language 
Lpres = {0,4,<,1,=,,0 > 1}, where =, is a binary relation for all n. 
Presburger arithmetic is the Lp;e,-theory Tpye, which is given by the fol- 
lowing axioms: 

+ the axioms of ordered abelian groups (cf. Exercise B.7.9); 

* an axiom stating that 1 is the smallest positive element; 


* for any n > 1, an axiom 9g, of the form 
Vx, V(X =, yo Azx+nz=y) 


and an axiom y,, of the form 


vere Leet) -+1. 


i times 


(1) Observe that 2 = (Z;0,1,+,<,=,) F Tpres, where all symbols 
have their usual interpretation in the integers. 


(2) Prove that Tp,e, eliminates quantifiers and is complete. 
(3) Deduce that Tpye, is decidable. 
Exercise 5.8.2. 
(1) Let ® = {#¢@ | vis asatisfiable £,,-sentence}. Prove that ® is 
not recursively enumerable. 


(2) Let ®,, be the set of codes #y of £,,-sentences ¢ satisfiable 
by an £4,-structure of domain {0,...,m — 1}, with m > 1 an 
integer. Prove that ®,, is primitive recursive. 


(3) One considers the set ®,;,, of codes #y of £,,-sentences ¢ sat- 
isfiable by a finite £,,-structure. Using the previous question 
and an appropriate coding, prove that ¢;, is recursively enu- 
merable. 


Exercise 5.8.3. Let £ = {P,c}, with P a unary predicate and c aconstant. 


(1) Determine all countable £-structures up to isomorphism. 


(2) Deduce that two £-structures M and 9 are elementarily equiv- 
alent precisely if the following two conditions hold: 
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¢ IMF Pc if and only if N F Pc, and 
- m & 3**x Qx if and only if N F 42*x Qx, for any k EN 
and any Q € {P, “P}. 


(3) Prove that an £-sentence ¢ is universally valid if and only if 
Im F @ for any finite £-structure. Deduce that the empty L- 
theory is decidable. 


Exercise 5.8.4. The aim of this exercise is to prove that there is a to- 
tal recursive function which is not provably total Z, (see p. for the 
definition). 


(1) Prove that there is a partial recursive function h € F," with the 
following properties: 
- ifa = #¢@ for a X,-formula ¢(Up, v,) and ifn € Nis such 
that there exists m € N with PA + g(n,m), then PA + 
9 (n, h(a,n)); 
¢ ifa = #9 for a X,-formula ¢(U9, v,) and if n € Nis such 
that there does not exist m € N with PAF ¢(n,m), then 
(a,n) € dom(h); 
¢ otherwise, h(a,n) = 0. 


(2) Choose h € F;* as above, and define g € F, as follows: 
- ifa = #9 for a Z,-formula g(Up, v,) and if b = ##d fora 
formal proof d of VugA! ¥; P(Up, Vv, ) in PA, then g(a, b,n) = 

h(a, n); 

« otherwise g(a, b,n) = 0. 

Prove that g is total recursive, and that it is universal prov- 
ably total X, in the following sense: a function f € F is prov- 
ably total 2, if and only if there are a,b € N such that f = 
An.g(a, b, n). 


(3) Conclude. 


Exercise 5.8.5 (End Extensions in Peano Arithmetic). In this exercise, 
we shall use the Omitting Types Theorem as proved in Exercise 2.7.3} 
The aim is to prove the following result, due to MacDowell and Specker: 


Let IN be a countable model of PA. Then there exists a proper elementary 


extension N > IN such that N is an end extension of M: foranym € M 
and everyn € N \ Mwe have 2 Fm <n. 
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(1) Let §@ F PA. Prove that the pigeonhole principle holds in Mm: 
for any £4,(M)-formula 6(v, z) and any a € M we have 


ME [Vx(Az > x)(Av < a)O(v, z)] > (dv < a)Vx(Az > x)@(v, z). 


(2) Let M F PA. Letcbeaconstant symbol which is not in £,,(M), 
and set L = L,,(M)U{c}. We now consider the £-theory T := 
D(QM)U{c > m | m € M}, where D(M) is the complete diagram 
of M (cf. Section B.2). 

(a) Check that T is consistent. 

(b) Leta € M and let O(v, z) be an £-formula such that T + 
Vu(@(v, c) > v < a) and such that T U {4v@(v, c)} is con- 
sistent. 

Prove that there exists m € M with m < a and such that 
mE Vx(Az > x)O(m, z). 

(c) Leta € M beanon-standard element. Consider the set of 

formulas 


Tq(v) :={v<abtufusm|meEeM}. 
Prove that z, is a non-isolated partial 1-type in T. 


(3) Conclude. 


Exercise 5.8.6 (Tenenbaum’s Theorem). Let IN be a non-standard model 
of PA and let n(x, y) be an £,,-formula with two free variables. Denote 
by S,(M) the set of A C N such that there is a € M with 


A={nEN|ME n(n, a}. 


Denote by S({M) the union of all S,,(%), where 7 runs over the set of 
£qr-formulas with two free variables. 


(1) Let no(x, y) be an £,,-formula such that for any pair of disjoint 
finite subsets A and B of N, the sentence 


ax ( /)\ x) A [\ >70(. ») 
icA jeEB 


is provable in PA. Prove that S,,,(M) = S(M). 
[Hint: Use overspill in M.] 
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(2) Prove that there is a 2,-formula 7, with two free variables such 
that for all n € N the sentence 


No(n, x) + Ay(a(n) - y = x) 


is provable in PA. Here, z(n) denotes the (n+ 1)th prime num- 
ber. Prove that S,,,(%) = S(M). 


(3) Let A and B be two disjoint recursively enumerable subsets of 

N. 

(a) The set of Aj-formulas is defined as the smallest set of 

£ar-formulas containing the atomic formulas and which 
is stable under A, = and under bounded quantification 
(Ax < t) and (Vx < t), with t a term not depending on 
the variable x. 
Observe that there are Ap-formulas a(x, y) and f(x, y) 
such that in M,,, A is defined by dya(x,y) and B by 
AyB(x, y). 

(b) Prove that for any k € N, 


ME (Wx, yz < k) (a(x, y) A B(x, z)), 
and that there is ¢ € M non-standard such that 


ME (Vx, y,Z < 6) (a(x, y) A B(x, Z)). 


(c) Consider A and B as in Exercise (that is, infinite and 
recursively inseparable) to deduce that S({) contains a 
non-recursive set. 


(4) If.M is countable and h : N > Misa bijection, one may trans- 
port the £,-structure of M via h~! on N, defining x +’ y := 
h7)(h(x) + h(y)), x -' y := ho} (h(x) - h(y)), ete. 

We now assume that the structure §N is recursive, that is, 
there exists a bijection h as above such that +’ and -’ are recur- 
sive functions. 

(a) For any fixed integer c € N, prove that the function f : 
N? — N given by f(n,m) = 1ifm+’---+’m = cand 
7(n) times 

f(n, m) = 0 otherwise, is recursive. 

(b) Deduce from this that S(t) does only contain recursive 
sets. 
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(c) Deduce Tenenbaum’s Theorem: 
There is no recursive non-standard model of PA. 
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Chapter 6 


Axiomatic Set Theory 


Introduction 


In this final chapter, we will formalize set theory within the first-order 
logic framework we have developed in former chapters. The language 
of set theory £. has only one non-logical symbol, the binary relation 
symbol €. We start by discussing in detail the Zermelo-Fraenkel axioms 
and the Axiom of Choice. In 6.3] we are finally in a position to prove the 
equivalence of the Axiom of Choice, of Zorn’s Lemma and the existence 
of well-orderings. 


Section 6.4jis devoted to the Axiom of Foundation and its connection 
with the von Neumann hierarchy. We are then able to prove some in- 
dependence and relative consistency results in 6.5, like the relative con- 
sistency of the Axiom of Foundation or the relative consistency of the 
negation of the existence of inaccessible cardinals. In the final section 
we discuss a few famous independence and relative consistency results 
whose proofs are outside the scope of this book, like the independence 
of the continuum hypothesis. 


6.1. The Framework 


We will work in the language £,., and we will consider £--structures 
U = (U; €) (where U will be called a universe of sets). 
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We call a set any element of U. The universe U itself is a set in the 
naive sense. The membership relation between two sets is given by the 
relation symbol €. 


In this way we may form the language £_ 1 (aset in the naive sense). 


Definition. A naive subset D of U is a class if there is an £< y-formula 
g(x) such that D = ¢[ 2]. 


Note that any set a defines a class Cy, which is given by the formula 
g(x) equal to x € a. By the Extensionality Axiom (which is not yet 
defined!), two sets a and b are equal if and only if Cy = C). 


By abuse of notation, we sometimes use € to denote membership 
(in the naive sense) in a class, writing c € D instead of Uf F g[c], where 
D = 9[U]. This should not create any confusion. 


6.2. The Zermelo-Fraenkel Axioms 


Definition. 


(1) The Zermelo-Fraenkel axiom system, denoted by ZF, is given by 
the following list of axioms: Extensionality, Comprehension 
Scheme, Pairing, Union, Power Set, Replacement Scheme, In- 
finity, Foundation. 


(2) If one adds the the Axiom of Choice (AC) to ZF, one obtains 
the axiom system ZFC. 


Remark. Some authors do not include the Axiom of Foundation in the 
axiom system ZF. 


We will now discuss the axioms in detail. 
Extensionality Axiom. Vx, y(Vz(zExozey)>x=y). 
It expresses that two sets having the same elements are equal. 


Notation. We write x C y as an abbreviation for Vz(z € x > z Ey). 


The Extensionality Axiom is then equivalent to the following sen- 
tence: Vx,y(x CyAVOxX>xX=Yy). 
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Comprehension Axiom Scheme. For any formula 9(Ug, ...,v,,) in the 
language £¢, one puts an axiom of the form 


W015 +5 Un41dUn42VU9 (Up E Unga (Uo E Ung A P(U9, +s Un)))- 


It expresses that if a is a set and D is a class, then the class {b € a | 
b € D} is given by a set. We will sometimes denote this set by an D. 


Here are some easy consequences of the axioms stated so far: 


« If a and b are two sets, then one may form their intersection 
anb={c€a|c € b}and, ina similar way, their difference 
a\b={cEea|c¢b}. 

* One obtains the existence of the empty set, denoted by 9. In- 
deed, let a be an arbitrary set. Then @ = {c € a| -c = c}. The 
uniqueness of § follows from Extensionality. 


- Ifa # @ is a set, one obtains the existence of () pen (as a 
set). To see this, choose an arbitrary by € a, and observe that 
hea b = te € bo | Vx(x Ea > cE x)}. 


Remark 6.2.1. Comprehension implies that there is no set containing 
all sets. Indeed, suppose for contradiction that U& F Vzz € a for some 
a € U. By Comprehension there is a set b such that UF Vz(z Ee bo 
z € z), so in particular b € b if and only if b ¢ b. This contradiction is 
Russell’s Antinomy, and it shows that a cannot exist. 


Pairing Axiom. Vy), y,.4xVz(z Ex 6 (Z=y, VZ= yp). 


It expresses that if a and b are two sets, then {a, b} is a set. 


Definition. The ordered pair (also called Kuratowski pair) of two sets a 
and b is the set (a, b) := {{a}, {a, b}}. 


Lemma 6.2.2. With the axioms stated so far, we have: 


(1) The collection of ordered pairs forms a class. Moreover, there is 
an £--formula 9(x, y) such that U EF gla, b] if and only if a is 
an ordered pair of the form a = (b,c). The analogous fact holds 
for the second coordinate. 


(2) One has (b,c) = (b’,c’) if and only if b = b' andc =c’. 


Proof. Exercise. 
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Union Axiom. Vy4xVz(z Ex — Jw(z EwAw Ee y)). 


It expresses that for every set a, the following class is given by a set: 
Ua :={z| dw(z Ee wAw Ea)}. 

Note that, by combining the Pairing Axiom and the Union Axiom, 
one gets that the union a U b of two sets a and b is given by a set. 


Power Set Axiom. VyaxVz(zExozCy). 


It postulates the existence of the set of subsets of a set, that is, for 
any set a the class P(a) = {b | b C a} is given by a set. 


Lemma 6.2.3. The axioms stated so far imply the existence of the carte- 
sian product of two sets a and b: ax b = {(x,y) |x E€ any e€ bhisa 
Set. 


Proof. If x € aand y € b, then one has {x}, {x, y} € P(a U b), whence 
(x,y) € P(P(a U b)). One may conclude by Comprehension, using 


Lemma 6.2.2. 


One may also define triples (x,y,z) := ((x, y),z) and more gener- 
ally n-tuples, via (X1,...,Xp41) != ((%1,.-.,X,),Xn41), inductively. One 
obtains a x b X c, and more generally a, X --- X dy. 


Definition. 


* A (binary) relation R is a set of ordered pairs. One sets 
dom(R) := {x | dy(x,y) € R}, called the domain of R, and 
im(R) := {y | dx(x, y) € R}, called the image of R. 


* A function f is a relation which is unique on the right, that is, 
which satisfies Vx, y,z((x,y) € fA(x,z) € ff > y =2Z). 
Remark. 


¢ Note that if R is a relation, then dom(R) and im(R) are sets. 
Indeed, for (x, y) € R one has x,y € [J (U R). 


* By definition, a function is identified with its graph. 
Notation. When f is a function, we usually write f(x) = y instead of 


(x, y) € f. Sometimes, for x ¢ dom(f), we set f(x) := 0. Ifa = dom(f) 
and im(f) € b, we write f : a > b. 
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Lemma 6.2.4. Let aand b be two sets. With the axioms we have stated so 
far, one then gets the following: 
(1) {R | Risa relation with dom(R) € a and im(R) C b} isa set. 
(2) {f | f : a > bhisa set. 


(3) The collection of functions forms a class. We denote by Fn(x) an 
£-formula which defines this class. 


Proof. Exercise. 


A family of sets, indexed by the set I, isa function f with dom(f) = I. 
One usually writes (a;);<7 for such a family, where a; = f(i). 


Remark. If (a;)j<7 is a family of sets with non-empty index set J, then 
the class Tex qa, :={g:I- (Par a; | Vz(z € I > g(z) € z)} is given 
by a set. 


Definition. We say that F C U? is a functional class if there is an Le y- 
formula g(x, y) which defines F such that 
UE WX, V1, Y2 (POV) A PO Ya) > Vi = Ya)- 


The class Dom(F) defined by Ay¢ is called the Domain of F, and the class 
Im(F) defined by 4x¢ is called the Image of F. 


Note that a functional class gives rise to a function in the naive sense, 
with domain Dom(F). One may also consider functional classes which 
correspond to naive functions between a class D C U” and U that are 
definable in Ley. 


Replacement Axiom Scheme. For each formula (x, y, v;,...,U,) in 
the language £¢, one puts an axiom of the form 


WU, Vy, «+s UnL VX, V1, Y2(P(X% 1,0) A P(X, 2,0) > Vy = V2) 
> dni VV € Ung1  Ax(x € Up A (x, y, v)))]. 


It expresses that if F C U? is a functional class and a is a set, then 
Fla] := {z | duu € aA(u,z) € F)} isa set. (It is obtained by ‘replacing’ 
every element of a by its image under F.) 


The axioms stated so far are not independent. Indeed: 
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Lemma 6.2.5. 


(1) The Replacement Axiom Scheme implies the Comprehension Ax- 
iom Scheme. 


(2) The Pairing Axiom is a consequence of the other axioms stated so 


far. 


Proof. Let g(x) be given. Let F(x, y) be the formula (x = y A 9(x)). 
Then {x € a | 9(x)} = F[a], proving (1). 

To prove (2), suppose that a and b are two sets. Observe that § € 
P(B) = {GO}, so in particular 6 4 P(G). Let F(x, y) be the formula ((x = 
OAy = a)V (x = PY) Ay = b)). This is a functional class, and for 
c = P(PG@)) = {B, PD}, one gets F[c] = {a, b}. 


Remark. We may work in an expansion by definition and use for ex- 
ample relation symbols x C y (binary) and f : x > y (ternary), func- 
tion symbols x Uy, xn y, x \ y, Uy, Py), {x,y}. (x,y), x X y, dom(R), 
im(R), the constant symbol 9, etc., in the Comprehension and Replace- 
ment schemes. By Proposition this does not change anything. 


Recall that a set x is an ordinal if it is a transitive set such that €},..x 
defines a well-order on x, that is, a total order which is well-founded. 
(Well-foundedness may be expressed by the following formula: Vy(y € 
P(X)A7Y =H > AzZ(’zZE YAZNy =9)).) Let Ord(x) be an £--formula 
which defines the class of ordinals (in all models of the axioms stated so 
far). We then get a formula 


Card(x) := Ord(x) A Vy(y € x — ‘there is no surjective’ f : y > x) 
which defines the class of cardinals. 


Notation. In what follows, we will use a, 8, y, etc., to denote only ordi- 
nals. For instance, we will write Vygy instead of Vy(Ord(y) > ¢). 


Axiom of Infinity (AI). 4x@ € x AVz(z Ex > zU{z} © x)). 
It postulates the existence of a set which contains @ and is stable 


under the ‘successor’ function. 
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Let a be aset as postulated by (AI). Applying Comprehension if nec- 
essary, we may assume that a is a set of ordinals. Let a be the least or- 
dinal such that a ¢ a. By construction, @ € a and a is stable under the 
successor function, so a@ is a limit ordinal. 


We let Lim(x) be a formula which defines the class of all limit ordi- 
nals. By what we have just seen, limit ordinals exist. One denotes by w 
the smallest limit ordinal. 


Axiom of Foundation (AF). Vx(-x = @ > 4z(zExAZNx =9Q)). 


Observe that (AF), together with the Pairing Axiom, implies that no 
set a contains itself as an element, since otherwise {a} would contradict 
(AF). 


Notation. We denote by ZF the axioms of ZF without (AF), and by 
ZFC the axioms of ZFC without (AF). 


Remark 6.2.6. 


(1) In any model of ZF, the ordinals satisfy the same properties 
as we have seen in Chapter Il Similarly for the cardinals in 
models of ZFC. 


(2) The class of ordinals is not a set. Indeed, if Ord were given by 
the set a, then a would be transitive and well-ordered by €, 
so an ordinal and thus a € a. Buta ¢ a for all ordinals by 


Proposition 
Similarly, Card is not a set. Indeed, if it were given by the 


set a, then Ord would be given by the set |) a. 


Lemma 6.2.7 (Transfinite induction). Let UF ZF , and let p(x) be an 
Lev-formula. Then U satisfies the following induction property: 


(0) A Vy(¢(yY) = vy + 1) 
AVy[(Lim(y) A V6(6 < y > 9(4))) > oY) > Vre(y). 
Proof. If U( satisfies 
90) A Vy(e(_Y) = Ply +1) A Vy[(Lim(y) A V6(6 < y > 9(9))) > oY] 


and Uf F 7g[ca] for some ordinal a, it is enough to choose a minimal such 
a to get a contradiction. 
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Theorem 6.2.8 (Definition by transfinite recursion). Let UF ZF , and 
let G be a functional class with Dom(G) = U"*1. Then there is a unique 
functional class F with Dom(F) = U" x Ord such that one has F(w, a) = 
G(W,F ta}xq) for every tuple of sets w and every ordinal a. 


Proof. The key idea is to approximate F by functions. We prove first 
that for every w and every ordinal f there is a unique function fG g with 
domain 6 such that fo (a) = G(w,{w} x fog ta) foralla < B. The 
uniqueness is clear by transfinite induction. 

Fix w. To establish existence, let us prove by induction on f that 
there is a function fg with domain f such that 


(*)o,8 Va(a<B > fgg(a) = GW, {w} x fog ta))- 


If 8 = 0, set fog = 9. 

Iff = 6’ +1and fGg is a function which satisfies («)G g, one may 
simply set fog = fag U{(6’, GU, (W} X fag))}- 

Finally, if 6 is a limit ordinal, we consider the set AG, B of all functions 
f such that there exists 8’ < 8 with dom(f) = f’ and f satisfies («)j5, gr. 
By Replacement and uniqueness, XZ g is indeed a set. Then, again by 
uniqueness, U fleXg f’ is a function, and it clearly satisfies (ag: 


We may now set (w, 8, y) € F : for any function f with domain 
6 + 1 satisfying (*)z5,g41 one has f(8) = y. 


Example 6.2.9 (Applications of transfinite recursion). 


(1) The operations of ordinal arithmetic (ordinal addition, mul- 
tiplication and exponentiation) may be defined by transfinite 
recursion (see Remark|1.6.5)). 


(2) The N-hierarchy of infinite cardinals is given by a functional 
class % : Ord —> Card. Indeed, it is sufficient to apply Theo- 
rem6.2.8|to the functional class G : U — U defined as follows: 

* G(0) = a. 

- If f : a > 6 for two ordinals a, 8, then G(f) equals the 
least cardinal x such that x > f(a’) for every a’ < a. 

« Finally, if x is not a function between two ordinals, then 


G(x) = @. 
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(3) The von Neumann hierarchy. By transfinite recursion, one de- 
fines a functional class a VY as follows: 
~hH=9. 
» Vers = PY). 
-Y=U,., % for/ a limit ordinal. 


Proposition 6.2.10 (ZF ). The ordinal w endowed with the ordinal op- 
erations (addition, multiplication and the successor function), with 0 = 6 
and < given by €, is a model of PA. 


Proof. Exercise. 


6.3. The Axiom of Choice 
Axiom of Choice (AC). 


Vf[Fn(f) A 9 ¢ im(f)) > 3g (Fn(g) A dom(g) = dom(f) 
AVx(x € dom(g) > g(x) € f(x)))]- 


It expresses that the product of a family of non-empty sets is non- 
empty. 
Definition. Let a be a set. A choice function on a is a function h : 


P(a) := P(a) \ {G6} = asuch that h(A) € A for all A € P(a)’. 


Proposition 6.3.1. (AC) is equivalent to the existence of a choice function 
on every Set a. 


Proof. Suppose (AC). For any set a, we then have [] ,- vay 4F g. Any 
element of this product is a choice function on a. 


Conversely, let (a;);<; be a family of sets with a; # @ for alli € I. 
Leta = hee a; andh : P(a)’ > a bea choice function on a. Then 
ho f € JJ,-; 4, where f : I > P(a)', fl) := qi. 

Theorem 6.3.2. The following are equivalent in the theory ZF : 
(1) (AC). 


(2) Zorn’s Lemma. 


(3) Zermelo’s Theorem (Wohlordnungssatz). 
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Proof. (1)=>(2): By way of contradiction, let us suppose that (X, <) is an 
inductive partial order without maximal element. We consider the set 


J ={T € P(X) | T is totally ordered by <}. 


As X is inductive and there is no maximal element in X, for any T € J, 
the set B(T) := {x €X |x >t Vt € T}is non-empty. By (AC), there is 
a function b : J > X such that b(T) € B(T) for any T € 7. 


Let G be the following functional class: 


« if g is a function with dom(g) € Ord and im(g) € J, then 
G(g) = b(im(g)); 
* G(g) = G, otherwise. 


Applying Theorem (6.2.8, we obtain a functional class F with 
Dom(F) = Ord such that F(a) = G(F | ) for every ordinal a. One 
shows easily by induction that F(a) € X for every a and that 


a< B= F(a) < F(f). 


In particular, F is injective as a naive function. The class F-!, defined 
by (x,a) € F-! :© (a,x) € F, is thus a functional class. We have 
F~1[X] = Ord, so Ord is a set by the Replacement Axiom Scheme. This 
contradicts Remark 6.2.6(2). 

(2)=(3): Let a be a set. We have to show that a admits a well-order. 
Consider X := {(b,R) | b € P(a) and R is a well-order on b}. It is easy 
to see that X is a set (exercise). 


On X, we define a partial order as follows: 
(b,R) < (b’,R’) :©@ bisan initial segment of (b’,R’) and R’ |, =R. 


This partial order is inductive. Indeed, if (b;, R;);ez is a totally ordered 
subset of X, then (b, R) := (Uje; bis U;-7 Ri) is an upper bound of this 
subset. By Zorn’s Lemma, there is an element (b, R) € X which is max- 
imal for <. If 6 4 a, there exists y € a \ b. We set b’ = bu {y} and 
R' := Ru{(x,y) | x © b}. It is then clear that R’ is a well-order on b’ 
which prolongates the one on b, contradicting the maximality of (5, R). 


(3)=>(1): Let a be a set. By hypothesis, a may be well-ordered, say 
by <. The function f : P(a)’ > a which associates to any non-empty 
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subset of a its smallest element is a choice function on a. We conclude 


by Proposition 


Remark 6.3.3 (Skolem’s Paradox). If ZFC is consistent, it has a (neces- 
sarily infinite) model. Since £<- is countable, ZFC thus admits a count- 
able model N by the Downward Lowenheim-Skolem Theorem. But there 
exist uncountable sets in N, for example X, or R. 


Explanation: The notion of ‘countability’ depends on the model of ZFC 
one works in, so it is relative. Being countable in the sense of M and 
being countable in the sense of the (naive) ground model of set theory 
in the metatheory is not the same thing. Moreover, the base set M of 
is a set from the naive point of view, and a proper class from the point of 
view of WM. 


More generally, the notion of cardinality depends on the model. For 
instance, the set of natural numbers N and the set of real numbers R 
(both taken in the sense of §%) are both countable from the point of view 
of the underlying naive ground model. But the bijection between these 
two sets which exists as a naive set is not even represented by a functional 
class in MN. Indeed, otherwise, by Replacement, it would be given by a 
function in the sense of M. 


6.4. The von Neumann Hierarchy and the Axiom of 
Foundation 


In this section, we will workin UE ZF . 


Definition. Let a be a set. By recursion on n € w, one defines ag := 
A, Any, 1= A, U Lay. The set tcl(a) := LU. a, is then called the 
transitive closure of a. 


new 


Lemma 6.4.1(ZF ). The set tcl(a) is the smallest transitive set containing 
aas a subset. More precisely, the following properties hold: 


(1) aC tel(a). 
(2) tcl(a) is a transitive set. 


(3) Ifa C t with t a transitive set, then tcl(a) C t. In particular, 
aC b> tel(a) C tel(b). 


(4) If ais transitive, then tcl(a) = a. 
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(5) Ifb €a, then tcl(b) C tcl(a). 
(6) tel(a) = aU U,., tel(b). 


Proof. (1) and (2) are clear. To prove (3), one shows by induction that 
a, Ct for alln € a. (4) follows from (1-3). 

(5) One has b € a= b € tel(a) > b C tel(a) => tcl(b) C tecl(a), 
where the first implication follows from (1), the second from (2), and 
the last one from (3). 

(6) One has tcl(a) D aU U, a tcl(b) by (1) and (5). To prove the 
other inclusion, it suffices by (3) to prove that aU U pee tcl(b) is transitive, 
which is clear. 


Recall the definition of the von Neumann hierarchy: Y% :=@, V4.4 := 
PV) VY t= Uge, Ye (for 2 a limit ordinal). 

We define the class V by the formula da x € Vy, which we will de- 
note by V(x). Informally, one thus has ‘V = Uy corg Ve: 


Definition. Ifa isa set, the rank of a is defined as 


Os {* least y such that a € V,,1, ifsuch ay exists; 


oo, otherwise. 
Lemma 6.4.2. 


(1) VY is a transitive set for any ordinal a. 
(2)Bsa>ycn. 

(3) W={x € V | rk(x) < ax}. 

(4) Ifx €V andy €x, theny € V and rk(y) < rk(x). 

(5) Ifx € V, then rk(x) = sup{rk(y) +1 | y € x}. 

(6) Ifx € V is transitive, then {rk(y) | y € x} is an ordinal a. 


(7) One has rk(a) = a for every a. In particular, a € V and {B € 
Vz | B € Ord} = a. 


(8) Let x bea set. Then x € V if and only ifx CV. 
(9) Assume (AC). If x € V is transitive, then x © Veara(x)+- 
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Proof. (1) & (2) By transfinite induction on a, one shows that Vy is tran- 

sitive and that Vg C Vg forall 6 < a. The cases a = 0 and ea limit ordinal 

are Clear. Now suppose a = y +1. Since J, isa transitive set by the induc- 

tion hypothesis, P(V,,) = Vz is transitive, too. If 8 < a, then 8 < yand so 

inductively V3 C V,, whence V3 € V, and finally V3 C V, by transitivity. 
(3) Ifx € V, then rk(x) <a@ (GB <a)xE Yi ox EN. 


(4) Let a = rk(x). Then x € V4, = P(\). For y € x, one obtains 
y € \ and hence y € V and rk(y) < a@ by (3). 


(5) For x € V, set a = sup{rk(y) +1 | y © x}. By (), one has 
a <rk(x). Since rk(y) < a@ for any y € x, it follows from (3) that x C V,, 
whence x € V,,, and finally a > rk(x) by definition. 


(6) Suppose x € V is transitive and 6 < rk(x) is given. By (5) there 
exists y € x such that 6 < rk(y). Choose such an element y with min- 
imal rank. If z € y, then z € x (by transitivity of x) and rk(z) < rk(y) 
by (4), so rk(z) < f by minimality of rk(y). This proves that y C Vg, and 
thus y € Vg,,, and hence rk(y) < f. It follows that rk(y) = 6. 

(7) We prove by transfinite induction that a € V and rk(a) = a, the 
case a = 0 being clear. Suppose that the result holds for all 6 < a. Then 
B € Ve4i © VY forall B < a, soa € Y and thus a € Vy4;. We get 
rk(a) = sup{68 +1| 6 < a}=aby(5). 

(8)x € V=>x C V follows from (4). Conversely, if x is a set such 
that x C V, then {rk(y)+1 | y € x}is given by a set by Replacement. For 
a = sup{rk(y) + 1 | y € x} we thus infer that x C Vz, and sox € W441. 

(9) By (6), the set {rk(y) | y € x} is equal to an ordinal a. Since 
a = sup{6 +1 | 6 < a}, by (5) we get rk(x) = a. Thus x © Yaracx)+ by 
(3), as card(a) < card(x) and hence a < card(x)*. 


Theorem 6.4.3. Let UF ZF . The following are equivalent: 


(1) WE (AF). 
(2) WE VxV(x). 


Proof. (2)>(1): Let x be a non-empty set. We have to find an element 
y of x such that yn x = 9. By hypothesis, every set is in V. We choose 
y € x such that rk(y) is minimal. Then y n x = 9 by Lemma 6.4.24). 
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(1)>(2): Let x be a set. Put y := tcl(x), and consider the set z := 
{t ey |t ¢ V}. Wehave x C y. By Lemma (6.4.(8), in order to prove 
x € V, it suffices to prove that every element of y is in V, in other words 
that z = @. If z were not empty, by (AF) there would exist t © z with 
tz =. Let us consider such a t. Then, for u € t, we would get u € y 
(by transitivity of y) and thus u € V (since tn z = @). But then t € V by 
Lemma .4.2(8), which is a contradiction. 


Remark. As the elements of V are constructed starting from the empty 
set (by a transfinite procedure), Theorem expresses that (AF) is 
equivalent to the fact that every set is constructed from the empty set. 


Remark 6.4.4. In the theory ZFC, the following are equivalent: 
(1) (AF). 


(2) There is no sequence (q;)j<, With a;,, € a; for allie w. 


Proof. (1)>(2): (This implication does not use (AC).) If (a;)je,, is a se- 
quence of sets, consider a = {a; | i € w}. By (AF), there exists b € a 
such that bn a = @. Hence for some n € w one has a, Na = G, soin 
particular ay,41 €¢ Ay. 

(2)>(1): Let a  @ be a set which contradicts (AF). This means that 
for every b € a one has bna # Q. By (AC) there exists a function 
f : a-— asuch that f(b) € b for every b € a. Choose ag € a, and 
define recursively a sequence (a,;)jc, putting ay.; = f(ay). 


Lemma 6.4.5. 
(1) Ifx € V, then |J x, P(x) and {x} are in V. The rank of these sets 
is strictly smaller than rk(x) + . 
(2) Ifx,y € V, thenx x y,xUy,xN y, {x, y}, (x, y) and x” are in 
V, too. Moreover, the rank of these sets is strictly smaller than 
max{rk(x), rk(y)} + @. 


Proof. Exercise. 


Remark. As the Extensionality Axiom (which expresses in some sense 
that there are only sets), the Axiom of Foundation restricts the universe 
of sets to the place where the usual mathematics take place: 
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¢ The usual objects of mathematics (for instance N and R) are in 
V (cf. Exercise 6.7.1). 


* Ifwe workin ZFC, then any (first-order) structure has an iso- 
morphic copy in V. (We suppose that the signature o“° of the 
language we consider is finite, to simplify the presentation.) 
Indeed, suppose that M@ = (M;R;j,c;, fc) is an Lo-structure, 
and let g : M = card(M) = « bea bijection. There exists a 
unique £y-structure 2% with base set x such that g is an Ly- 
isomorphism between IN and . Then x € Y,,, andso MN € 
Veo by LemmaBa3 


¢ This remains true for structures outside the first-order context. 
Consider for example the case of a topological space (X, 7), 
with X a set and J C P(X) the set of open subsets of X. Then 
if g : X = xis a bijection, one may argue as before to prove 
that (Xx, 7) admits an isomorphic copy in \,.). 


6.5. Some Results on Incompleteness, Independence 
and Relative Consistency 


In this section, we will suppose that the £<-formulas and the proofs are 
coded in ZF . To this aim, we may for example appeal to the coding 
we have given in arithmetic and use that if F ZF and w € U, then 
(w;0, +, -,succ, Ef ,) F PA (see Proposition 6.2.10). We continue to write 
#¢ for the code of g. 


Gédel’s incompleteness results have their analogues in set theory. 
We will state them and leave the proofs to the reader - the arguments 
are analogous to those in the case of Peano arithmetic. However, coding 
formulas and formal proofs is slightly simpler in set theory. Let us note 
that ZF ,ZF,ZFC and ZFC are recursive theories. 


Theorem 6.5.1 (Gédel’s First Incompleteness Theorem). Let T be a re- 
cursive and consistent £<-theory such that T D ZF . Then T is incom- 
plete. 


Remark. If ZF is consistent, then (AF) is independent of ZF, that is, 
ZF ¥ (AF) and ZF_ ¥ -(AF). (This is shown in Exercise 6.7.7) Sim- 
ilarly, (AC) is then independent of ZF and (CH) is independent of ZFC. 
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All of (AF), (AC) and (CH) are mathematically meaningful statements. 
In contrast to this, in the case of arithmetic, assuming that PA is con- 
sistent, the sentences which we proved to be independent of PA were 
rather far fetched and involved the diagonal method. 


Theorem 6.5.2 (Gédel’s Second Incompleteness Theorem). Let T be a 
recursive and consistent £ --theory such that T D ZF . ThenT K Con(T). 


We work in a model U of ZF . One may then code the satisfaction 
in U as follows. Let £ bea (finite) language and %& = (A;(Z™)z<5c) an L- 
structure with 2% € U. One identifies the set of assignments (with values 
in 2f) with A®, which is an element of U. The following lemma may be 
proved by induction on the height of a formula, which corresponds to 
an induction on w. We leave the details as an exercise. 


Lemma 6.5.3. There exists a functional class which to any triple (A, #¢, a) 
in U, with XL an L-structure, p an £-formula and a an assignment with 
values in &, associates 1 if 2% FE pla], and 0 otherwise. 


In particular, if X is an £-structure in U, then the collection # Th(2) = 
{#@ | g is an £-sentence such that & F ¢} is given by a set in U. 


A relative consistency result has the following form: given two the- 
ories T, and T>, the consistency of T, is proved to imply the consistency 
of T,. The proof method will be to construct a model of T, from a model 
of T. 

Let us start with a result which establishes a relation between set 
theory and arithmetic. One verifies without difficulty that the proof of 
Proposition 6.2.10 may be done in ZF. As the structure (w; +, x, S, 0, <) 
isaset in Uf, Lemma entails the following result. 


Proposition 6.5.4. One has ZF | Con(PA). In particular, if ZF is 
consistent, then Peano arithmetic PA is consistent, too. 


Let UF ZF . If X is a non-empty class in U, one may consider the 
(naive) £.-structure (X;€} x). In the proofs of the relative consistency 
results we will present, we will construct classes X such that if UF T;, 
then (X; fx) F T>, whence the desired relative consistency result. 


Notation. We will sometimes write X F y instead of (X; Ex) F 9. 
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Definition (Relativization). Let £ be a language, and let F(vg) be an 
£-formula. For every £-formula ¢ one defines, by induction on ht(g), 
an £-formula ¢", the relativization of ¢ to F: 
- of :=gif¢g isan atomic formula. 
» (PAP) := (AP) and [-g]" += >[p"]. 
© [Axg]P := Ax(F(x) A oF). 
Proposition 6.5.5. Let UF ZF , and let X C U bea class. 
(1) Suppose that X = F[U] = G[U]. For every £.-formula ¢, the 
formulas g¥ and ¢® are then equivalent in U. One may thus 


write o* instead of ¢", if one is only interested in the formula up 
to equivalence. 


(2) For every ay,...,a, € X and formula 9(x,..., Xp), one has U F 
¢* [a] if and only if X F g[al. 
Proof. (1) The proof is an easy induction on ht(¢) and left as an exercise. 


(2) The proofis by induction on ht(¢), the only non-trivial case being 
the case when ¢ is of the form 4x ~. In this case, one has 


UE o* [a1 .05@,] <> UE Axg(F(Xo) AW Nays -s Ay] 
<= there exists b € U such that U F F[b] and UF *[b, a] 
<= there exists b € X such that U F #*[b, a] 
<=> there exists b € X such that X F ~[b,a] <> X F ¢fal. 


Definition. Let X be a class defined by an £.-formula F(up). One 
says that an £.-formula ¢(%},...,x,) is absolute for X if one has U F 
Vx10 Xn (Aj, FO) > @ & 9%). 

Definition. The set of Ag-formulas is the smallest set of £.-formulas 
which contains the atomic formulas and which is stable under boolean 
combinations and bounded quantification (if ¢ is a Aj-formula and x, y 
are two distinct variables, then 4x(x € y Ag) and Vx(x € y > ) are 
Ao-formulas, too). 


Lemma 6.5.6. 


(1) Assume that X is a non-empty class which is transitive (that is, 
zE€yEeX>z EX). Thenall Ag-formulas are absolute for X. 
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Moreover, the set of formulas which are absolute for X is stable 
under boolean combinations and bounded quantification. 


(2) The following properties may be expressed by Ay-formulas: x C 


y,x = G6,x = yU {y}, & is transitive? z = {x,y} y = Ux, 
Z=XXY. 


Proof. (1) Atomic formulas are absolute for every non-empty class. 
Moreover, for any class X, the set of formulas which are absolute for 
X is stable under boolean combinations. We now prove that if X is a 
transitive (non-empty) class and if g(x, y, Uy,...,U,,) is absolute for X, 
then the formula 4x(x € y A ¢) is absolute for X, too. For elements 
b, C,...,Cy, © X, one has the following equivalences: 


UE (Ax(x Ey AG))* [b,7] 

<= UF (AX(F(x) Ax E yA G*)) [B,C] 

<> there exists a € bM.X such that UF g*[a, b,c] 

<=> there exists a € bm X such that UE ¢[a, b,c] 

<> there exists a € b such that U F g[a, b,c] 

= UF (Ax(x E yAg)) [b,c]. 
The first equivalence holds by definition, the third one follows from 
the induction hypothesis and the fourth one from the transitiviy of X. 
Stability under bounded universal quantification follows as well, since 
Vx(x € y > ¢) is equivalent to =Ax(x € y A 779). 


The proof of (2) is straightforward. For example, the transitivity of 
x is expressed by the Ap-formula: Vy(y € x — Vz(z Ey > ZEx)). 


Let X beaclass. We say that a functional class G(x, y) with Dom(G) > 
X is an absolute functional class for X (in U) if G(x, y) is absolute for X 
and if for any a € X, the unique b € U such that U F G(a,b) is in X. 
The following remark follows easily from the definitions: 


Remark 6.5.7. A composition of absolute functional classes is an abso- 
lute functional class (for X). 


Lemma 6.5.8. Weworkin UE ZF . 
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(1) Any non-empty transitive class satisfies the Extensionality Ax- 
iom. 


(2) LetX bea class defined in U, and let G be an absolute functional 
class for X. Then X F VxA! yG(x, y). 
Moreover, if 9 is absolute for X, then 4x(x € G(z) A g) and 
Vx(x € G(z) > ¢) are absolute for X, where x, z are distinct 
variables. 


(3) The Union Axiom holds in V and in any V,, with a > 0. 


(4) The Pairing Axiom holds in V and in any V, with A a limit ordi- 
nal. 


(5) Suppose that UF (Vx € X)\(Ay € X)P(x) NX = y for some 
non-empty transitive class X. Then the Power Set Axiom holds 
in X. In particular, the Power Set Axiom holds in V and in V, for 
any limit ordinal A. 

Moreover, y = P(x) is a functional class which is absolute 
for V and V, with A a limit ordinal. 


Proof. (1) The Extensionality Axiom relativized to X is given by the sen- 
tence (Vy EX)(VZEX)(VxX EX\(x Eyoxe€z) > y =2Z). Since X is 
transitive, if y,z € X, then y,z C X. The result follows. 


(2) The first part is clear. Now assume that G is an absolute func- 
tional class for X and that ¢y(v, x, z) is absolute for X. Given a,b from 
X, let c = G(b) (computed in U or in X, which amounts to the same by 
the first part of (2)). Then & F Ax(x € G(z) A ¢)[a, b] if and only if 
uF Ax(x € yA ¢g)[a,c], and similarly for X in place of U. The absolute- 
ness of 4x(x € G(z) A ¢) then follows from Lemma 6.5.6(1). 

(3) The formula y = L) x is Ay by Lemma .5.4, so absolute for tran- 
sitive classes. As rk((J x) < rk(x), one hasx EY > Ux e & foralla 
and in particular x € V > )x € V. One concludes by (2). 

(4) One has x,y € % => {x,y} © Ya). It follows in particular 
thatx,y € Y => {x,y} © Y for 2 a limit ordinal. One deduces that 
(x, y)  {x, y} is given by a functional class which is absolute for V and 
Y (for A a limit ordinal), and one may conclude by (2). 

(5) One has U F Vx € XAy © X(P(xX) NX = y) if and only if 
UFVx ExXAyEeXVzE X(zCxoz Ey). The latter sentence is just 
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the Power Set Axiom relativized to X, since z C x is absolute for X by 
the transitivity of X. The proof of the remaining part of (5) is left as an 
exercise. 


Lemma 6.5.9. The formulas Ord(x) and Card(x) are absolute for V as 
well as for Vy if A is a limit ordinal. 


Proof. We first consider Ord(x). The transitivity of x and the fact that 
€} , defines a total order are expressed by Ag-formulas and are thus ab- 
solute in both cases. The items (2) and (5) of Lemma 6.5.8) entail that the 
following formula is absolute: 


Vz(zEP(XAAZ#EB > Ju(UE ZAVWWEZ> WwW EN))). 
This establishes the absoluteness of well-foundedness. 
The formula Ord(x) A Vy(y € x > 7Sf © P(xxy)f ix & y) 
is equivalent to Card(x). One proves easily that the formula f : x = y 
(in the three variables f, x, y) is absolute for V and for ¥ if A is a limit 


ordinal, and that the functional class z = x x y is an absolute functional 
class in both cases (exercise). This proves the absoluteness of Card(x) in 


both cases, using Lemma 6.5.8(2) and Remark (6.5.7), 


Definition. We work in ZF . Let x be a cardinal. 


¢ We say x is a strong limit cardinal if for all u < x one has 2" < x. 


* We say x is inaccessible (strongly) if it is strong limit, regular 


and > No. 
Example. By transfinite recursion, one defines a cardinal hierarchy as 
follows: 2p := No, Qgei := 274, and Dy := SUP, <q 2a for A a limit 
ordinal. 


For any limit ordinal A, 2, is then a strong limit cardinal. But as 
cof(2,) = cof(A) in this case, 2, is singular in general. 
Lemma 6.5.10. Let U F ZF , and letX = VorxX = V,, orX = \% 
for x an inaccessible cardinal. In the last case, we assume in addition that 
uF (AC). Then X satisfies the Replacement Axiom Scheme and the Axiom 
of Foundation. 


Proof. Choose a formula F(x) which defines X. Let G(Up, v,) be a for- 
mula (with parameters from X) which defines a functional class in the 
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structure (X,€} y). Then the formula H(up, v,;) = F(Ug) A F(v,) A GF 
defines a functional class in U. Set b := H[a] = {H(c) | c € a}, where a 
isasetin xX. 


- IfX = V, then b € V by Lemma 6.4.2(8), since b C V and b is 
a set. 


« IfX = V,,, then, as ais finite, b is a finite subset of V,,, so b C V, 
for some n € w and hence b € ¥,. 


« IfX = ¥, for an inaccessible cardinal x, essentially the same ar- 
gument works. One proves by transfinite induction that 
card(V,) < « for alla < x. (Since we assume U F (AC), we 
may use cardinalities.) For a a successor ordinal, one uses that 
x is strongly limit; for a a limit ordinal, one uses the regularity 
of x. 

Now, ifa € ¥, thena € Vy for some a < x and thus 
aC 41. It follows that card(b) < card(a) < «. Moreover, one 
has b C \. Since x is regular, sup{rk(c) + 1 | c € b} < x and 
hence be \. 


This proves the Replacement scheme in all three cases. 


In order to prove that (AF) holds, we consider § 4 a € X. It follows 
that a C X in all three cases, as X is transitive. For any b € a of minimal 
rank, one has an b = @, and any such b is in X. One concludes, since 
the formula an b = @ is absolute for X. 


Lemma 6.5.11. One has V F (AI), Y FE (AD) for any limit ordinal A > a, 
and V,, F =(AI). 


Proof. One has w € V,w € Yj and w ¢ V,. The details are left to the 
reader. 


Lemma 6.5.12. If UF ZFC , then V F (AC) and Y F (AC), for x an 
inaccessible cardinal. 


Proof. Let (X;);-; be an element of 4%. Then I € V, and every element 
ge Il <1 Xi is in V,. For V, one concludes by the same argument. 


Theorem 6.5.13 (Relative consistency of (AF)). 
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(1) IfUF ZF , then V F ZF. In particular, the consistency of ZF— 
entails the consistency of ZF. 


(2) If F ZFC , then V F ZFC. In particular, the consistency of 
ZFC entails the consistency of ZFC. 


Proof. It suffices to combine the preceding lemmas, taking into account 
the fact that the Replacement scheme implies the Comprehension 


scheme (cf. Lemma 6.2.5). 


Theorem 6.5.14. [fF ZF, then V,, EF ZFC—(AI)+-(AD). In particular, 
ZF F Con(ZFC — (AI) + -(AD). 


Proof. We have proved everything in the lemmas but V,, F (AC). Ob- 
serve that every element a of V, is a finite set and in bijection with an 
element of w. Such a bijection f as well as the well-order on a induced 
by f isin VY. 


Let (IC) be the statement ‘There exists an inaccessible cardinal’. 


Theorem 6.5.15. Let UF ZFC andx € U bean inaccessible cardinal. 
Then V, F ZFC. In particular, ZFC + (IC) F Con(ZFC). 


Proof. All axioms of ZFC” hold in Y, by the preceding lemmas. 


By the Second Incompleteness Theorem, it follows from Theorem 
6.5.15|that ZFC K (IC). The following theorem states the corresponding 
relative consistency result. We will give a direct proof which does not 
rely on the Second Incompleteness Theorem. 


Theorem 6.5.16. If ZFC is consistent, then ZFC + =(IC) is consistent, too. 


Proof. Let U F ZFC. We may suppose that 7 F (IC). Let « be the 
smallest inaccessible cardinal in U. Then %. F ZFC by Theorem 
To conclude, it suffices to prove that ¥% F -=(IC), a fact which follows 
from the following observations: 


* y = P(x), Ord(x) and Card(x) are absolute formulas for %. 
(cf. Lemma and Lemma §.5.9). 
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‘A is a regular cardinal’ is absolute for \. [Indeed, 
it is easy to see that this property may be expressed by the for- 
mula 7(Aa < A)\(Af : a > A cofinal).] 

‘A is a strong limit cardinal’ is absolute for V,.. [Indeed, this 


property may be expressed by the following formula (in A): 
(Wa < A)7(Sf : P(x) > A surjective). ] 


Remark 6.5.17. If ZFC is consistent, then 
ZFC F Con(ZFC) > Con(ZFC + (IC)). 


Proof. Assume ZFC F Con(ZFC) — Con(ZFC+(IC)). Then we have in 
particular ZFC +(IC) F Con(ZFC) > Con(ZFC™ +(IC)), so ZFC+(IC) F 
Con(ZFC+(IC)) by Theorem.5.15,, By the Second Incompleteness The- 
orem, this means that ZFC + (IC) is inconsistent, which implies the in- 
consistency of ZFC by assumption. 


6.6. A Glimpse of Further Independence and Relative 
Consistency Results 


At the end of this chapter, we will mention some important further re- 
sults on independence and relative consistency in axiomatic set theory. 
We will not give any proofs, but we will sketch some key ideas of the 
methods behind these results. This part is meant as a motivation for 
further reading. The books by Krivine [6] and Kunen [[7] are excellent 
references. 


The results in the following theorem may be obtained, in a 
rather elementary way, by the method of Fraenkel-Mostowski. They 
will be treated entirely in the exercise section (cf. Exercise and 


Exercise 6.7.8). 
Theorem 6.6.1. 
(1) If ZF is consistent, then so is ZFC” + 7(AF). 
(2) If ZF is consistent, then so is ZF + 7(AC). 
It is possible to replace ZF + =(AC) by ZF + =(AC) in the second 


part of the preceding theorem, but then the proof gets more involved (see 
below). 
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Recall that the Generalized Continuum Hypothesis (GCH) is given 
by the sentence Var(2*« = Nq41). 


Theorem 6.6.2 (Godel). If ZF is consistent, then so is the theory ZFC + 
(GCH). 


Gédel obtains this result using the universe of constructible sets L. 
The construction of L is similar to that of V inside a model U of ZF . 
Here is an outline of the construction. 


* One uses the formalization in U F ZF of the syntax and the 
satisfaction for structures (a; €} ,), for ain U. 

* One proves that there is a functional class D which to any set 
associates the set of its parameter definable subsets, that is, if 
ais aset, the set D(a) equals 

{b € P(a) | there aren € w, an £Le-formula ¢(Up,..., v,) and 
Cy,++5Cy € awith b = {cy Ea | (a3 Ef gy) F G[eo, C1, --s Cyl}. 
By convention, one sets D(@) := P(B) = {}. 


¢ Using induction, one defines Ly := 9, Ly41, := D(Lg), and 


Les eee Lj for A a limit ordinal. Finally, one sets ‘L = 
U. Lq’, that is, L is defined by the formula L(x) = da(Ord(a) A 
x EL). 


* As for the von Neumann hierarchy, one may establish certain 
basic properties, for instance, a < 8 > Lg € Lg and the tran- 
sitivity of the L,. In this way, one gets a continuous hierarchy 
of transitive sets (Ly) geora: 


* One defines a rank rk, as in the case of V, and one may prove 
that rk, (a) = a, that is, L; NM Ord = a. In particular, Lis a 
proper class. 


¢ The proof that L F ZF is very similar to the one for V. It also 
follows that L has the same ordinals as U. 


¢ One proves that in U, L- = L holds. In other words, L satisfies 
the Axiom of Constructibility Vx L(x). To prove this, one estab- 
lishes that the functional class D is absolute for any transitive 
class X which is a model of ZF with the Power Set Axiom taken 
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away, and then that the same absoluteness statement holds for 
the functional class a b Ly. 


¢ In order to prove that L F (AC), one uses transfinite recursion 
to construct a well-ordering on Ly, such that ifa < 6, then Ly 
is an initial segment of Le. In the successor step, one uses the 
well-ordering Ly to construct a well-ordering on LE (the set 
of finite sequences in L,), then on Lg41, also enumerating the 
(codes) of £<-formulas. 


* Finally, L F (GCH) is established as follows. One proves first 
that, in any modelof ZF , one has card(L,) = card(c) for every 
infinite ordinal w. Then, one proves that, in L, one has L, = 
{x | card(tcl(x)) < x} for every infinite cardinal x. Thus, L F 
P(x) C L,+, and so L F (GCH). 


Remark. A weaker statement, namely that the consistency of ZF entails 
the consistency of ZFC, is the content of Exercise (6.7.6. 


Theorem 6.6.3 (Cohen). 


(1) If ZF is consistent, then so is ZFC + 7=(CH). 
(2) If ZF is consistent, then so is ZF + =(AC). 


In particular, combining the results by Gédel and Cohen, one gets 
the following result concerning the cardinality of the continuum. 


Corollary 6.6.4. If ZF is consistent, the Continuum Hypothesis (CH) is 
independent of ZFC. 


Cantor had formulated (CH) and thus raised the problem of the car- 
dinality of the continuum in 1878. It was then put forward by Hilbert 
as the first problem of his list of 23 important problems in mathematics 
which he presented at the International Congress of Mathematicians in 
Paris in 1900. 


Contrary to the constructions of models of set theory we have seen 
so far (and to the ones which will be treated in the exercises), Cohen’s 
construction is not done inside the ground model U. The method Cohen 
introduced in 1964 to prove his results, termed forcing, allows for con- 
structions of models which extend a given model. 
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Weakenings of the Axiom of Choice. The following two variants of 
the Axiom of Choice are often considered. 


Axiom of Dependent Choice (DC). 


WrVaVxo| (x9 Ea Ar Ca* AVx Eady €a(x,y) Er) 
> Af(f :@ > aA f(0) = x9 AVN E w((f(n), f(n + 1)) €r))). 
Axiom of Countable Choice (CC). 


Vf[(Fn(f) A dom(f) = 0 AG ¢ im(f)) 
> Ag (Fn(g) A dom(g) = w A Vx(x € dom(g) > g(x) € f(x)))I- 


It expresses that the product of a countable family of non-empty sets 
is non-empty. 


Proposition 6.6.5. One has (AC) > (DC) > (CC). 


Proof. (AC)>(DC) Let h : P'(a) > abea choice function on a. By 
induction on w, define a function f : w > a, setting f(0) := Xo, f(n+ 
1) := hy €a| Cf), y) € r}). Clearly, f is as required. 

(DC)>(CC) Let X,)new be a countable family of non-empty sets. 
Set Y, :={n}x X, anda:=l)_, Y,, and let 


new 
r :={(x, y) € a* | there exists n € w such that (x,y) € Y, X Yaar}: 
By (DC) there exists a function f : w — asuch that f(n) € Y, for 


alln € w. Theng € Tie X,, where g(n) := a((f(n)), with z the 
projection onto the second coordinate. 


The usual construction of a set of real numbers which is not Lebesgue 
measurable may be done in ZFC. It proves the following: 


Proposition 6.6.6. 
ZFC E ‘there exists a subset of R which is not Lebesgue measurable’ 
Let us mention a theorem which is more difficult (its proof uses forc- 
ing). 


Theorem 6.6.7 (Solovay, 1970). IfZFC+(IC) is consistent, then the theory 
ZF + (DC) + ‘very subset of R is Lebesgue measurable’ is consistent, too. 
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6.7. Exercises 


Exercise 6.7.1. We work in Uf F ZF , and we suppose that the sets 
N, Z,... are defined in the usual manner. 


(1) Prove that the sets N, Z, R, C, C°({0, 1], C) are in V,.». 
(2) Compute the ranks of the sets in (1). 


Exercise 6.7.2 (Mostowski Collapse). We work in UF ZF . Let X bea 
class and R C X x X a definable relation on X. 


We say R is set-like if for any a in X, the class pred,(a) of 
R-predecessors of a, defined by R(x, a), is given by a set; R is well-founded 
if any non-empty subset a of X contains an R-minimal element; finally, 
R is extensional if pred,(a) = pred,(b) implies a = b. 


(1) Assume that R C XxX is set-like. By induction on a, forx € X, 
define the relation rkp(x) > a as follows: 
° rkp(x) > 0 for all x € X; 
* rkp(x) > a +1 if and only if there is y € pred,(x) such 
that rkp(y) > a; 
¢ ifAisa limit, then rkp(x) > Aifand only ifrkp(x) > a for 
alla <A. 

Set rkp(x) := 00 if rkp(x) > a for all a; otherwise, rkp(x) 
is defined as the least a such that rkp(x) # a+1. (rkp is called 
the foundation rank.) 

Prove that R is well-founded if and only ifrkp takes ordinal 
values on X. 


(2) Assume now that R is well-founded and set-like. 

(a) Prove that if F is a functional class with Domain X and 
Image contained in Ord such that R(a, b) implies F(a) < 
F(b), then rkp(x) < F(x) for all x € X. 

(b) Prove that the Image of rkp is either an ordinal or the 
whole of Ord. 

(c) Prove that there is a unique functional class z with 
Dom(z) = X such that for every a in X one has z(a) = 
z[pred,(a)]. This z is called the Mostowski Collapse of 
(X, R). Moreover, prove that Im(z) is a transitive subclass 
of V. 
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(3) Suppose that R is well-founded, set-like and extensional, and 
let zx be the Mostowski Collapse. Prove that z induces a naive 
isomorphism between (X, R) and (Im(7), € htmcz)): 


(4) Suppose that X is a proper class and that R defines a set-like 
well-ordering (that is, a total order which is well-founded) on 
X. Observe that R is extensional. Prove that Im(z) = Ord, that 
is, z induces a naive order isomorphism between (X, R) and 
(Ord, <). 

Example: Let X = Ord<® be the class of finite sequences of 
ordinals. For s = (So, ...,S,_,) and t = (tg,... t—1) from Ord~® 
set R(s, t) if and only if 

* max(s) < max(t), or 

* max(s) = max(ft) andn < m, or 

* max(s) = max(t),n =m,s#tands; < t,, where k is the 

smallest index where s and ¢ differ. 

Prove that R is a set-like well-ordering on X. 


Exercise 6.7.3 (Reflection Principle). We work insome UF ZF . 


(1) Aclass of ordinals X C Ord is called CLUB if it is closed (that 
is, for every subset x C X, one has sup x € X) and unbounded 
in Ord. 

Observe that the intersection of two CLUB classes is CLUB. 

(2) Let (Wa) ezeora be a continuous hierarchy of sets, that is, a func- 
tional class W : Ord > U, W(a) = W,, such that 

* Wa CW, for alla < § and 
© WM =U, WM for any limit ordinal 2. 


Let W be the ‘union’ of the Wy, that is, the class defined by 
the formula da x € W,. Given an £--formula ¢(x1,...,xX,), we 
say that Wy reflects 9 if 


EVE a0 X, (A x EW > (94D oeco)] 
i=1 


(a) Observe that the ordinals a such that Wy reflects g form a 
class. 
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(b) Prove the following General Reflection Principle: For ev- 
ery formula g(x), the class of ordinals such that W, re- 
flects g contains a CLUB. 


[Hint: Proceed by induction on the height of ¢.] 

(c) Deduce the usual Reflection Principle: For any formula 
(x), one has 
ZF F Vyda(y € % AVX (9(x) o p’@(x))). 


Exercise 6.7.4 (Independence of the Replacement Axiom Scheme). The 
Zermelo axioms Z are given by the axioms of ZF without the Replacement 
scheme. We work in UF ZF. 


(1) Prove that if 2 > w is a limit ordinal, then VY F Z. 
(2) Prove that V2 does not satisfy the Replacement scheme. 
[Hint: Consider the functional class sending a to w + a.] 


(3) Deduce that if Z is consistent, then it does not imply the Re- 
placement scheme. Similarly, prove that if Z+(AC) is consis- 
tent, then it does not imply the Replacement scheme. 


Exercise 6.7.5 (Non-finite axiomatizability of ZF). The aim of this ex- 
ercise is to prove that if T D ZF is consistent, then T is not finitely ax- 
iomatizable. 


(1) Let UF ZF, and let_X C U be a transitive class which is also a 
model of ZF. Prove that for any ordinal 6 € X one has vi a 


74 aX, where vs denotes ‘Vg computed in x’. 

(2) Let U F gy, where ¢ is a sentence such that g + ZF. Using the 
Reflection Principle (cf. Exercise prove that there is a in 
U such that VY F @. 

(3) Conclude. 

(4) Prove that if ZF is consistent, then it is not finitely aximatiz- 
able. 


(5) Prove the following sharpening: If T D ZF is consistent, then 
there is no sentence ¢ such that y together with the Compre- 
hension scheme axiomatizes T. 
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Exercise 6.7.6 (Relative consistency of (AC)). We work in U F ZF. In 
this exercise, we assume that the set of £<-formulas is coded in w. 


(1) The class of ordinal definable sets is given by the £--formula 
OD(x) which expresses the following: ‘There exists a, a 
formula (Z,y¥o,..-,¥n-1) and a sequence (@p,...,a,_1) 
of ordinals < a such that x is the only element of V, with VY, F 
Q(X, Ag, An—1]- 

(a) Prove that if there are a (naive) £<-formula ®(x) with pa- 
rameters from Ord such that a is defined by ®(x), that is, 
®[U] = {a}, then a isin OD. 

[Hint: Use the Reflection Principle (Exercise 6.7.3). ] 


(b) Prove that there is a (naive) £--formula Pop (x, y) with- 
out parameters such that a set a is in OD if and only if 
there is an ordinal a such that a is defined by Pop (x, a). 


[Hint: Use that any #¢ is an ordinal and that any finite 
sequence of ordinals may be coded by an ordinal (Exer- 
cise 6.7.2).] 

(c) Prove that the class of formulas in one free variable 
with parameters from Ord may be well-ordered (cf. Exer- 
cise by an Q-definable set-like relation. Deduce the 
same for the class OD. 


(2) The class of hereditarily ordinal definable sets is given by the 
formula HOD(x) which expresses that every element of tcl({x}) 
isin OD. 

(a) Prove that all ordinals as well as V,, are in HOD. 

(b) Prove that HOD is an @-definable transitive class, and that 
a set a is in HOD if and only if it is in OD and all its ele- 
ments are in HOD. 

(c) Prove that HOD F ZF. 

(d) Prove that HOD F(AC). 

(e) Conclude that if ZF is consistent, then ZFC is consistent. 

(3) (a) Prove that VHOP = y,. 

(b) An £_-sentence is said to be arithmetical if all its quan- 
tifiers are relativized to V,,. Prove that if an arithmetical 
sentence is provable in ZFC, then it is provable in ZF. 
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Exercise 6.7.7 (Fraenkel-Mostowski models I - independence of (AF)). 
We work in (U;€) = UWE ZF . Let F bea functional class which defines 
a bijection between U and itself. Setx ©’ y :@ x € F(y) and U' = 
(U;€'). 

(1) Prove that Ul’ isa model of ZF which satisfies (AC) if U does. 


(2) Anatom isaset asuch that a = {a}. Prove that one may choose 
F such that UW’ contains an atom. If Uf F(AF), prove that there 
is F such that the set A of atoms in 2’ is in bijection with . 


(3) Prove thatif ZF (resp. ZFC” ) is consistent, then so is ZF + 
-=(AF) (resp. ZFC” + 7=(AF)). 


Exercise 6.7.8 (Fraenkel-Mostowski models II - relative consistency of 
-(AC)). We work in UF ZF and assume that U contains a non-empty 
set of atoms A. 


(1) By transfinite induction, define W = A, Wai, = P(W,), and 
W=U <a We for 4 a limit ordinal. Let W be the class defined 
by the formula da x € W,. 
(a) Prove that all W, are transitive sets and that one has a < 
p> W, CW. 
(b) Prove thatWFE ZF. 
(c) Prove that an element a € W is an atom if and only if 
acw. 
[Hint: One may use an appropriate notion of rank in W.] 
(d) Prove that W satisfies 


Va(a # @ > Ab Ea(b= {b} vV¥x E€a(x €b))). 
(e) Prove that UF Vx(W (x) — W(x)). 


Now assume that the class of atoms of 2 is given by the set A which 
is in bijection with w, and that UF VxW(x). (Note that if ZF is consis- 
tent, such a model of ZF exists by the first part, combined with Exer- 


cise 6.7.71) 


(2) Let 09 be a permutation of W = A. 
(a) Prove that for any a, o9 extends uniquely to an automor- 
phism o, of (Wz, €}w,,). Moreover, prove that if B < a, 
then o, wa= Og. 
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(b) Deduce that there is a (unique) functional class o which 
extends oy and induces a (naive) automorphism of U. 

(c) Let ®(x,...,x,) be an £--formula without parameters. 
Prove that 


UE Vx... VX,» [P(X], ..., Xp)  B((X),..., 7(Xp,))].- 
(d) Prove that o(a@) = a for every ordinal a. 


(3) The class of ordinal and atom definable sets is given by the 
£_-formula OAD(x) which expresses the following: “There 
exists a formula 9(Z, yo,.--,¥n—15Z0.-+»Zm—1), an ordinal a, 
a sequence (@p,...,@,-1) Of ordinals < a and a sequence 
(dg, ..-,Am_1) of atoms such that x is the only element of W, 
with W, F 9[x, a, .-,&n—1, Ags» An_1)- 

(a) Prove that if there is a (naive) £--formula ®(x) with pa- 
rameters from Ord and A which defines the set c, then c 
is in OAD. 

[Hint: For this and what follows, proceed as in Exer- 
cise 6.7.4] 

(b) Prove that there is a (naive) £--formula ®oap(x, y, Z) 
such that a set c is in OAD if and only if there is an ordinal 
a and a finite sequence of atoms s such that c is defined 
by Doapd(X, Qa, S). 

(c) Let c € A? be a total order on A. Prove that c is not in 
OAD. 


(4) The class of hereditarily ordinal and atom definable sets is given 
by the formula HOAD(x) which expresses that every element 
of tcl({x}) is in OAD. 

(a) Prove that A and every ordinal is in HOAD. 

(b) Prove that HOAD F ZF +-(AC)+-(AF). 

(c) Conclude that if ZF” is consistent, so is the theory ZF + 
=(AC)+7(AF). 
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